Movies2Scenes: Learning Scene Representations Using Movie Similarities

02/22/2022
by   Shixing Chen, et al.
0

Automatic understanding of movie-scenes is an important problem with multiple downstream applications including video-moderation, search and recommendation. The long-form nature of movies makes labeling of movie scenes a laborious task, which makes applying end-to-end supervised approaches for understanding movie-scenes a challenging problem. Directly applying state-of-the-art visual representations learned from large-scale image datasets for movie-scene understanding does not prove to be effective given the large gap between the two domains. To address these challenges, we propose a novel contrastive learning approach that uses commonly available sources of movie-information (e.g., genre, synopsis, more-like-this information) to learn a general-purpose scene-representation. Using a new dataset (MovieCL30K) with 30,340 movies, we demonstrate that our learned scene-representation surpasses existing state-of-the-art results on eleven downstream tasks from multiple datasets. To further show the effectiveness of our scene-representation, we introduce another new dataset (MCD) focused on large-scale video-moderation with 44,581 clips containing sex, violence, and drug-use activities covering 18,330 movies and TV episodes, and show strong gains over existing state-of-the-art approaches.

READ FULL TEXT

page 1

page 3

page 6

page 8

page 13

page 14

page 15

page 16

research
02/26/2019

Learning Latent Scene-Graph Representations for Referring Relationships

Understanding the semantics of complex visual scenes often requires anal...
research
10/20/2022

MovieCLIP: Visual Scene Recognition in Movies

Longform media such as movies have complex narrative structures, with ev...
research
06/04/2023

MoviePuzzle: Visual Narrative Reasoning through Multimodal Order Learning

We introduce MoviePuzzle, a novel challenge that targets visual narrativ...
research
04/06/2020

A Local-to-Global Approach to Multi-modal Movie Scene Segmentation

Scene, as the crucial unit of storytelling in movies, contains complex a...
research
08/19/2020

Learning Trailer Moments in Full-Length Movies

A movie's key moments stand out of the screenplay to grab an audience's ...
research
12/14/2020

Movie Summarization via Sparse Graph Construction

We summarize full-length movies by creating shorter videos containing th...
research
04/28/2021

Shot Contrastive Self-Supervised Learning for Scene Boundary Detection

Scenes play a crucial role in breaking the storyline of movies and TV ep...

Please sign up or login with your details

Forgot password? Click here to reset