Light

lewah / mangosplit Goto Github PK

View Code? Open in Web Editor NEW

1.0 1.0 0.0 56.81 MB

website

Jupyter Notebook 99.89% TSQL 0.11%

mangosplit's Introduction

NETFLIX MOVIE DATA ANALYSIS PROJECT

Netflix Movies and TV Shows project dataset is from this link - https://www.kaggle.com/datasets/shivamb/netflix-shows

Create a conda environment: conda create --name netflix conda activate netflix

conda create -n env-01 python=3.9 scipy=0.15.0 numpy exmaple
pip install --upgrade seaborn matplotlib

0. Ask QUestions

which movie has the highest country viewing/releases - Done
Which actors are most likely to work together?
what type of content is added over months like holiday season (december , july, january) and the quantity of released content over the months
which countries have the largest quantity of released content , group this by content type . What are the most common genres in the top 5 countries ? . visualise the type of content produced by countries
explore the “Age” of content on Netflix, which means the gap between when movies/shows are released and when they are added.
see how 11 varies per country
Find out more on the movie and tv rating , visualise TV vs Movies and group them based on the targeted audience eg : kids, young adult , teenagers, adults
Visualise 13 based on countries
Movie and TV Show Genres, quantity of content released (in the genre)
group 15 based on type(content)
Netflix Titles
Netflix Description
spliting the date_added column (second link)

Data cleaning

Data Exploration

How do the variables correlate?
what type of content have they been focussing on over the years?
Movie and TV Show Duration
What are the top 10 genres on Netflix ?
Find out more on the movie and tv rating and Group them based on the targeted audience eg : kids, young adult , teenagers, adults Data Visualisation
Which countries have contributed most movies in recent years?
what is the content release at netflix like ?
what is the distribution of Netflix’s content by origin, or country ?
what type of content have they been focussing on over the years?

1.Data collection

Data Preparation After downloading the dataset, I load the dataset into a dataframe for the data cleaning process

2.Data cleaning and processing

Fill in the NaN values from the dataset Making sure there arent any NULL value in our data to make the data consistent. culumns with null value include :
- rating
- date_added
- director
- cast
- country
- duration
Deleting redundant columns.
- Handling invalid values on the date_added column, some values in the date_added column are greater than those in the release_year column (i.e. the year the movie was added is earlier than that it was released)
- drop such invalid values to ensure data accuracy
Dropping duplicates.
Cleaning individual columns.

Data wrangling

Data Transformation

3-Exploratory analysis & Visualization

Links ive used for reference:

mangosplit's People

Contributors

Stargazers

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

Jobs