Skip to content

arnav1803/Data-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Movie Dataset Analysis 🎬

This project analyzes a dataset of 10,000 movies from The Movie Database (TMDb), focusing on understanding the impact of various features like budget, genre, and runtime on movie popularity, revenue, and ratings.

Objectives 📌

Budget and Popularity: Determine if higher budgets lead to higher popularity. Top Revenue Features: Identify common features in the top 10 revenue-generating movies. Genre Trends: Analyze the most popular genres over the years. Length, Ratings, and Revenue: Explore how movie length correlates with ratings and revenue.

Summary of Findings 📝

Budgets and Popularity: While high budgets don’t guarantee popularity, they tend to result in higher average popularity. Revenue Insights: Adventure and Western genres have become increasingly popular, with longer movies often achieving higher revenues. Trends Over Time: Revenue and popularity patterns show that genre preferences evolve, with significant increases in average revenue since the 1960s.

Dataset 📂 The dataset includes information such as:

Features: Budget, revenue, popularity, genres, runtime, release date, and adjusted financial figures for inflation.

Key Libraries 📚

Pandas: For data manipulation and cleaning. Matplotlib & Seaborn: For visualizing trends in the dataset.

Conclusion 🏁

This analysis provides insights into how movie characteristics like budget, genre, and runtime influence popularity and revenue, revealing trends that could be useful for industry predictions.

About

Data analysis on TMDB Dataset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published