Video Re-localization

08/05/2018
by   Yang Feng, et al.
4

Many methods have been developed to help people find the video contents they want efficiently. However, there are still some unsolved problems in this area. For example, given a query video and a reference video, how to accurately localize a segment in the reference video such that the segment semantically corresponds to the query video? We define a distinctively new task, namely video re-localization, to address this scenario. Video re-localization is an important emerging technology implicating many applications, such as fast seeking in videos, video copy detection, video surveillance, etc. Meanwhile, it is also a challenging research task because the visual appearance of a semantic concept in videos can have large variations. The first hurdle to clear for the video re-localization task is the lack of existing datasets. It is labor expensive to collect pairs of videos with semantic coherence or correspondence and label the corresponding segments. We first exploit and reorganize the videos in ActivityNet to form a new dataset for video re-localization research, which consists of about 10,000 videos of diverse visual appearances associated with localized boundary information. Subsequently, we propose an innovative cross gated bilinear matching model such that every time-step in the reference video is matched against the attentively weighted query video. Consequently, the prediction of the starting and ending time is formulated as a classification problem based on the matching results. Extensive experimental results show that the proposed method outperforms the competing methods. Our code is available at: https://github.com/fengyang0317/video_reloc.

READ FULL TEXT

page 2

page 13

research
05/10/2019

Spatio-temporal Video Re-localization by Warp LSTM

The need for efficiently finding the video content a user wants is incre...
research
10/31/2019

Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos

Temporal sentence grounding in videos aims to detect and localize one ta...
research
06/15/2023

The 2023 Video Similarity Dataset and Challenge

This work introduces a dataset, benchmark, and challenge for the problem...
research
10/15/2022

Semantic Video Moments Retrieval at Scale: A New Task and a Baseline

Motivated by the increasing need of saving search effort by obtaining re...
research
08/12/2019

Sentence Specified Dynamic Video Thumbnail Generation

With the tremendous growth of videos over the Internet, video thumbnails...
research
05/04/2021

A Fast Partial Video Copy Detection Using KNN and Global Feature Database

We propose a fast partial video copy detection framework in this paper. ...
research
08/22/2023

Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition

We are concerned with a challenging scenario in unpaired multiview video...

Please sign up or login with your details

Forgot password? Click here to reset