GithubHelp home page GithubHelp logo

project3_visualizations's Introduction

IronHack Logo

Guided Project: Visualizing Real World Data

Data Visualization of IMDb Movies
Archana Rao

Ironhack | Amsterdam | Data Analytics Bootcamp - 26-09-2020

Contents

  • Project Description
  • Questions & Hypotheses
  • Dataset
  • Database
  • Workflow
  • Organization
  • Links

Project Description

This project talks about the trends, behavior and analysis of various factors involved in movies found in the most popular website IMDb. (Internet Movie Database). Given that thousands of movies were produced and released each year, the success quotient is never stable as the trends keep changing. The focus of investigation in this project is mainly on the genre of movies, though the influence of IMDb score is also being analyzed.

Questions & Hypotheses

  1. Trends of different genres over the years - What kind of movies are liked by the audience? Are the genres with high IMDb scores highly profited too?

  2. What does a movie with a good IMDb score indicate? What are the other factors revolving around it?

Dataset

The dataset is taken from Kaggle which is in turn from the IMDb database itself. While there are lot of choices for movies datasets apart from IMDb, this one has information for movies released from 1916 till 2016

Workflow

Choose an interesting topic and gather a dataset with appropriate details and file format. Understand the data and come up with problem statements. Perform data wrangling/cleaning. Update ipynb / python file with interpretation using Pandas/python and visualization of data using Seaborn and Matplotlib plots. Present the data insights with storytelling in a slide deck and evaluate the deviation of facts from hypothesis with proper reasoning. Create a README file which describes about the project.

Organization

No Git repository is created exclusively for this project. The main deliverables are the python file , an updated README and the PPT. The .py /.ipynb files detailing about the analysis are pushed to the individual project folder of the class repository. Only the README file will be available in the data folder. Any raw files / image files, data file(dataset)and the presentation file(.ppt) is uploaded in the google drive folder.

Links

Presentation: https://drive.google.com/drive/folders/1RoaPxJ66y_ZHWJpA9vJjeauQ0uxfo3J5 Raw files: CSV file / dataset : https://drive.google.com/drive/folders/1N3u6K0tNsq2VbfoL3BSYDqfFJTjM0223

project3_visualizations's People

Contributors

archanarao011 avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.