Moviescope: Large-scale Analysis of Movies using Multiple Modalities

08/08/2019
by   Paola Cascante-Bonilla, et al.
5

Film media is a rich form of artistic expression. Unlike photography, and short videos, movies contain a storyline that is deliberately complex and intricate in order to engage its audience. In this paper we present a large scale study comparing the effectiveness of visual, audio, text, and metadata-based features for predicting high-level information about movies such as their genre or estimated budget. We demonstrate the usefulness of content-based methods in this domain in contrast to human-based and metadata-based predictions in the era of deep learning. Additionally, we provide a comprehensive study of temporal feature aggregation methods for representing video and text and find that simple pooling operations are effective in this domain. We also show to what extent different modalities are complementary to each other. To this end, we also introduce Moviescope, a new large-scale dataset of 5,000 movies with corresponding movie trailers (video + audio), movie posters (images), movie plots (text), and metadata.

READ FULL TEXT

page 1

page 8

research
11/09/2017

Enhanced Movie Content Similarity Based on Textual, Auditory and Visual Information

In this paper we examine the ability of low-level multimodal features to...
research
09/14/2021

Multilevel profiling of situation and dialogue-based deep networks for movie genre classification using movie trailers

Automated movie genre classification has emerged as an active and essent...
research
06/14/2018

From Trailers to Storylines: An Efficient Way to Learn from Movies

The millions of movies produced in the human history are valuable resour...
research
11/21/2020

Exploring the multimodal information from video content using deep learning features of appearance, audio and action for video recommendation

Following the popularisation of media streaming, a number of video strea...
research
04/05/2020

Deep Multimodal Feature Encoding for Video Ordering

True understanding of videos comes from a joint analysis of all its moda...
research
08/19/2020

Victim or Perpetrator? Analysis of Violent Characters Portrayals from Movie Scripts

Violent content in the media can influence viewers' perception of the so...
research
01/26/2021

A Case Study of Deep Learning Based Multi-Modal Methods for Predicting the Age-Suitability Rating of Movie Trailers

In this work, we explore different approaches to combine modalities for ...

Please sign up or login with your details

Forgot password? Click here to reset