Analyzing movies to predict their commercial viability for producers

01/05/2021
by   Devendra Swami, et al.
0

Upon film premiere, a major form of speculation concerns the relative success of the film. This relativity is in particular regards to the film's original budget, as many a time have big-budget blockbusters been met with exceptional success as met with abject failure. So how does one predict the success of an upcoming film? In this paper, we explored a vast array of film data in an attempt to develop a model that could predict the expected return of an upcoming film. The approach to this development is as follows: First, we began with the MovieLens dataset having common movie attributes along with genome tags per each film. Genome tags give insight into what particular characteristics of the film are most salient. We then included additional features regarding film content, cast/crew, audience perception, budget, and earnings from TMDB, IMDB, and Metacritic websites. Next, we performed exploratory data analysis and engineered a wide range of new features capturing historical information for the available features. Thereafter, we used singular value decomposition (SVD) for dimensionality reduction of the high dimensional features (ex. genome tags). Finally, we built a Random Forest Classifier and performed hyper-parameter tuning to optimize for model accuracy. A future application of our model could be seen in the film industry, allowing production companies to better predict the expected return of their projects based on their envisioned outline for their production procedure, thereby allowing them to revise their plan in an attempt to achieve optimal returns.

READ FULL TEXT
research
08/22/2019

Song Hit Prediction: Predicting Billboard Hits Using Spotify Data

In this work, we attempt to solve the Hit Song Science problem, which ai...
research
08/15/2018

Folksonomication: Predicting Tags for Movies from Plot Synopses Using Emotion Flow Encoded Neural Network

Folksonomy of movies covers a wide range of heterogeneous information ab...
research
10/24/2018

Data-driven Blockbuster Planning on Online Movie Knowledge Library

In the era of big data, logistic planning can be made data-driven to tak...
research
04/03/2018

Predicting Gross Movie Revenue

'There is no terror in the bang, only is the anticipation of it' - Alfre...
research
10/13/2021

Presenting a Larger Up-to-date Movie Dataset and Investigating the Effects of Pre-released Attributes on Gross Revenue

Movie-making has become one of the most costly and risky endeavors in th...
research
01/10/2020

Measuring Women Representation and Impact in Films over Time

Women have always been underrepresented in movies and not until recently...

Please sign up or login with your details

Forgot password? Click here to reset