Detecting Kissing Scenes in a Database of Hollywood Films

06/05/2019
by   Amir Ziai, et al.
0

Detecting scene types in a movie can be very useful for application such as video editing, ratings assignment, and personalization. We propose a system for detecting kissing scenes in a movie. This system consists of two components. The first component is a binary classifier that predicts a binary label (i.e. kissing or not) given a features exctracted from both the still frames and audio waves of a one-second segment. The second component aggregates the binary labels for contiguous non-overlapping segments into a set of kissing scenes. We experimented with a variety of 2D and 3D convolutional architectures such as ResNet, DesnseNet, and VGGish and developed a highly accurate kissing detector that achieves a validation F1 score of 0.95 on a diverse database of Hollywood films ranging many genres and spanning multiple decades. The code for this project is available at http://github.com/amirziai/kissing-detector.

READ FULL TEXT

page 4

page 5

research
10/20/2022

MovieCLIP: Visual Scene Recognition in Movies

Longform media such as movies have complex narrative structures, with ev...
research
12/29/2022

Efficient Movie Scene Detection using State-Space Transformers

The ability to distinguish between different movie scenes is critical fo...
research
03/20/2020

Detection in Crowded Scenes: One Proposal, Multiple Predictions

We propose a simple yet effective proposal-based object detector, aiming...
research
02/26/2021

Where to look at the movies : Analyzing visual attention to understand movie editing

In the process of making a movie, directors constantly care about where ...
research
12/14/2020

Movie Summarization via Sparse Graph Construction

We summarize full-length movies by creating shorter videos containing th...
research
03/24/2022

Bayesian Nonparametric Submodular Video Partition for Robust Anomaly Detection

Multiple-instance learning (MIL) provides an effective way to tackle the...
research
04/27/2023

Analogy-Forming Transformers for Few-Shot 3D Parsing

We present Analogical Networks, a model that encodes domain knowledge ex...

Please sign up or login with your details

Forgot password? Click here to reset