Skip to content

Exploratory Data Analysis (EDA) of the Netflix Movies and TV Shows dataset using Python, pandas, matplotlib, and seaborn.

Notifications You must be signed in to change notification settings

BlladeRunner/netflix-eda-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

14 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🎬 Netflix EDA Project

πŸ“Œ Project Description

  • This project is an Exploratory Data Analysis (EDA) of the Netflix Movies and TV Shows dataset.
  • The goal is to explore the Netflix catalog identify trends in across years, countries, genres, ratings, and content duration.

πŸ› οΈ Tools & Libraries

  • Python 3.11
  • Pandas
  • Numpy β€” data analysis and cleaning
  • Matplotlib
  • Seaborn β€” data visualization

πŸ“Š Analysis Workflow

  • Load and inspect the dataset
  • Data cleaning and preprocessing (duplicates, missing values, parsing dates and duration)

Exploration of:

  • Movies vs TV Shows distribution
  • Release years vs years added to Netflix
  • Movie durations and number of TV show seasons
  • Top countries by content production
  • Genres and categories distribution
  • Age ratings distribution
  • Top directors and actors

πŸ”Ž Key Insights

  • πŸ“ˆ Netflix rapidly expanded its library between 2015 and 2020.
  • πŸŽ₯ Most movies are 80–120 minutes long.
  • πŸ“Ί The majority of TV shows have only 1 season.
  • 🌍 USA and India dominate Netflix content production.
  • 🎭 Most common categories include International Movies, Dramas, Comedies.
  • πŸ”ž A large share of Netflix content targets mature audiences (TV-MA, TV-14).

πŸ“‚ Project Structure

netflix_eda_project/ β”œβ”€ data/ # dataset (optional, can be downloaded separately) β”œβ”€ netflix_eda.ipynb # Jupyter Notebook with analysis β”œβ”€ README.md # project description β”œβ”€ requirements.txt # dependencies └─ .gitignore # ignore rules for Git

πŸ”— Dataset

The dataset is available on Kaggle:

πŸ’Ό Business Relevance

  • Understanding Netflix’s content distribution helps identify strategic markets, content preferences, and opportunities for localized production.
  • This EDA can guide decisions for media acquisition and audience targeting.

πŸ”™ Back to Portfolio

About

Exploratory Data Analysis (EDA) of the Netflix Movies and TV Shows dataset using Python, pandas, matplotlib, and seaborn.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published