VVS: Video-to-Video Retrieval with Irrelevant Frame Suppression

03/15/2023
by   Won Jo, et al.
0

In content-based video retrieval (CBVR), dealing with large-scale collections, efficiency is as important as accuracy. For this reason, several video-level feature-based studies have actively been conducted; nevertheless, owing to the severe difficulty of embedding a lengthy and untrimmed video into a single feature, these studies have shown insufficient for accurate retrieval compared to frame-level feature-based studies. In this paper, we show an insight that appropriate suppression of irrelevant frames can be a clue to overcome the current obstacles of the video-level feature-based approaches. Furthermore, we propose a Video-to-Video Suppression network (VVS) as a solution. The VVS is an end-to-end framework that consists of an easy distractor elimination stage for identifying which frames to remove and a suppression weight generation stage for determining how much to suppress the remaining frames. This structure is intended to effectively describe an untrimmed video with varying content and meaningless information. Its efficacy is proved via extensive experiments, and we show that our approach is not only state-of-the-art in video-level feature-based approaches but also has a fast inference time despite possessing retrieval capabilities close to those of frame-level feature-based approaches.

READ FULL TEXT

page 1

page 8

page 13

page 14

page 19

page 21

research
05/18/2022

VRAG: Region Attention Graphs for Content-Based Video Retrieval

Content-based Video Retrieval (CBVR) is used on media-sharing platforms ...
research
05/28/2016

Video Key Frame Extraction using Entropy value as Global and Local Feature

Key frames play an important role in video annotation. It is one of the ...
research
04/16/2021

Self-supervised Video Retrieval Transformer Network

Content-based video retrieval aims to find videos from a large video dat...
research
11/29/2018

AdaFrame: Adaptive Frame Selection for Fast Video Recognition

We present AdaFrame, a framework that adaptively selects relevant frames...
research
04/20/2023

Search-Map-Search: A Frame Selection Paradigm for Action Recognition

Despite the success of deep learning in video understanding tasks, proce...
research
01/04/2021

Personal Privacy Protection via Irrelevant Faces Tracking and Pixelation in Video Live Streaming

To date, the privacy-protection intended pixelation tasks are still labo...
research
03/07/2019

Fast Video Retargeting Based on Seam Carving with Parental Labeling

Seam carving is a state-of-the-art content-aware image resizing techniqu...

Please sign up or login with your details

Forgot password? Click here to reset