Unsupervised Extraction of Video Highlights Via Robust Recurrent Auto-encoders

10/06/2015
by   Huan Yang, et al.
0

With the growing popularity of short-form video sharing platforms such as Instagram and Vine, there has been an increasing need for techniques that automatically extract highlights from video. Whereas prior works have approached this problem with heuristic rules or supervised learning, we present an unsupervised learning approach that takes advantage of the abundance of user-edited videos on social media websites such as YouTube. Based on the idea that the most significant sub-events within a video class are commonly present among edited videos while less interesting ones appear less frequently, we identify the significant sub-events via a robust recurrent auto-encoder trained on a collection of user-edited videos queried for each particular class of interest. The auto-encoder is trained using a proposed shrinking exponential loss function that makes it robust to noise in the web-crawled training data, and is configured with bidirectional long short term memory (LSTM) LSTM:97 cells to better model the temporal structure of highlight segments. Different from supervised techniques, our method can infer highlights using only a set of downloaded edited videos, without also needing their pre-edited counterparts which are rarely available online. Extensive experiments indicate the promise of our proposed solution in this challenging unsupervised settin

READ FULL TEXT

page 1

page 8

research
09/07/2017

An unsupervised long short-term memory neural network for event detection in cell videos

We propose an automatic unsupervised cell event detection and classifica...
research
05/26/2016

Video Summarization with Long Short-term Memory

We propose a novel supervised learning technique for summarizing videos ...
research
02/12/2020

Constructing a Highlight Classifier with an Attention Based LSTM Neural Network

Data is being produced in larger quantities than ever before in human hi...
research
11/15/2016

Diversity encouraged learning of unsupervised LSTM ensemble for neural activity video prediction

Being able to predict the neural signal in the near future from the curr...
research
07/21/2017

Shallow reading with Deep Learning: Predicting popularity of online content using only its title

With the ever decreasing attention span of contemporary Internet users, ...
research
11/24/2017

For Your Eyes Only: Learning to Summarize First-Person Videos

With the increasing amount of video data, it is desirable to highlight o...
research
01/02/2018

Unsupervised Object-Level Video Summarization with Online Motion Auto-Encoder

Unsupervised video summarization plays an important role on digesting, b...

Please sign up or login with your details

Forgot password? Click here to reset