A Faster Method for Tracking and Scoring Videos Corresponding to Sentences

11/14/2014
by   Haonan Yu, et al.
0

Prior work presented the sentence tracker, a method for scoring how well a sentence describes a video clip or alternatively how well a video clip depicts a sentence. We present an improved method for optimizing the same cost function employed by this prior work, reducing the space complexity from exponential in the sentence length to polynomial, as well as producing a qualitatively identical result in time polynomial in the sentence length instead of exponential. Since this new method is plug-compatible with the prior method, it can be used for the same applications: video retrieval with sentential queries, generating sentential descriptions of video clips, and focusing the attention of a tracker with a sentence, while allowing these applications to scale with significantly larger numbers of object detections, word meanings modeled with HMMs with significantly larger numbers of states, and significantly longer sentences, with no appreciable degradation in quality of results.

READ FULL TEXT
research
08/25/2016

Title Generation for User Generated Videos

A great video title describes the most salient event compactly and captu...
research
09/20/2013

Saying What You're Looking For: Linguistics Meets Video Search

We present an approach to searching large video corpora for video clips ...
research
08/08/2016

Learning Joint Representations of Videos and Sentences with Web Image Search

Our objective is video retrieval based on natural language queries. In a...
research
11/27/2021

An analysis of document graph construction methods for AMR summarization

Meaning Representation (AMR) is a graph-based semantic representation fo...
research
06/21/2013

Discriminative Training: Learning to Describe Video with Sentences, from Video Described with Sentences

We present a method for learning word meanings from complex and realisti...
research
05/07/2017

Generating Memorable Mnemonic Encodings of Numbers

The major system is a mnemonic system that can be used to memorize seque...
research
09/28/2021

CIDEr-R: Robust Consensus-based Image Description Evaluation

This paper shows that CIDEr-D, a traditional evaluation metric for image...

Please sign up or login with your details

Forgot password? Click here to reset