Large-scale, Fast and Accurate Shot Boundary Detection through Spatio-temporal Convolutional Neural Networks

05/09/2017
by   Ahmed Hassanien, et al.
0

Shot boundary detection (SBD) is an important pre-processing step for video manipulation. Here, each segment of frames is classified as either sharp, gradual or no transition. Current SBD techniques analyze hand-crafted features and attempt to optimize both detection accuracy and processing speed. However, the heavy computations of optical flow prevents this. To achieve this aim, we present an SBD technique based on spatio-temporal Convolutional Neural Networks (CNN). Since current datasets are not large enough to train an accurate SBD CNN, we present a new dataset containing more than 3.5 million frames of sharp and gradual transitions. The transitions are generated synthetically using image compositing models. Our dataset contain additional 70,000 frames of important hard-negative no transitions. We perform the largest evaluation to date for one SBD algorithm, on real and synthetic data, containing more than 4.85 million frames. In comparison to the state of the art, we outperform dissolve gradual detection, generate competitive performance for sharp detections and produce significant improvement in wipes. In addition, we are up to 11 times faster than the state of the art.

READ FULL TEXT

page 2

page 4

page 8

page 9

page 12

page 13

page 14

page 20

research
05/23/2017

Ridiculously Fast Shot Boundary Detection with Fully Convolutional Neural Networks

Shot boundary detection (SBD) is an important component of many video an...
research
08/13/2018

Fast Video Shot Transition Localization with Deep Structured Models

Detection of video shot transition is a crucial pre-processing step in v...
research
06/08/2019

TransNet: A deep network for fast detection of common shot transitions

Shot boundary detection (SBD) is an important first step in many video p...
research
10/14/2017

Video Classification With CNNs: Using The Codec As A Spatio-Temporal Activity Sensor

We investigate video classification via a two-stream convolutional neura...
research
05/25/2018

Unsupervised Learning for Large-Scale Fiber Detection and Tracking in Microscopic Material Images

Constructing 3D structures from serial section data is a long standing p...
research
03/20/2020

Fully Automated Hand Hygiene Monitoring in Operating Room using 3D Convolutional Neural Network

Hand hygiene is one of the most significant factors in preventing hospit...

Please sign up or login with your details

Forgot password? Click here to reset