Textually Customized Video Summaries

02/06/2017
by   Jinsoo Choi, et al.
0

The best summary of a long video differs among different people due to its highly subjective nature. Even for the same person, the best summary may change with time or mood. In this paper, we introduce the task of generating customized video summaries through simple text. First, we train a deep architecture to effectively learn semantic embeddings of video frames by leveraging the abundance of image-caption data via a progressive and residual manner. Given a user-specific text description, our algorithm is able to select semantically relevant video segments and produce a temporally aligned video summary. In order to evaluate our textually customized video summaries, we conduct experimental comparison with baseline methods that utilize ground-truth information. Despite the challenging baselines, our method still manages to show comparable or even exceeding performance. We also show that our method is able to generate semantically diverse video summaries by only utilizing the learned visual embeddings.

READ FULL TEXT

page 1

page 7

research
06/23/2014

VideoSET: Video Summary Evaluation through Text

In this paper we present VideoSET, a method for Video Summary Evaluation...
research
12/19/2017

On the Evaluation of Video Keyframe Summaries using User Ground Truth

Given the great interest in creating keyframe summaries from video, it i...
research
12/19/2017

Bipartite Graph Matching for Keyframe Summary Evaluation

A keyframe summary, or "static storyboard", is a collection of frames fr...
research
08/01/2018

From Thumbnails to Summaries - A single Deep Neural Network to Rule Them All

Video summaries come in many forms, from traditional single-image thumbn...
research
01/26/2019

Real-time Video Summarization on Commodity Hardware

We present a method for creating video summaries in real-time on commodi...
research
08/14/2022

TL;DW? Summarizing Instructional Videos with Task Relevance Cross-Modal Saliency

YouTube users looking for instructions for a specific task may spend a l...
research
01/27/2018

Understanding Deep Architectures by Interpretable Visual Summaries

A consistent body of research investigates the recurrent visual patterns...

Please sign up or login with your details

Forgot password? Click here to reset