STAR-GNN: Spatial-Temporal Video Representation for Content-based Retrieval

08/15/2022
by   Guoping Zhao, et al.
0

We propose a video feature representation learning framework called STAR-GNN, which applies a pluggable graph neural network component on a multi-scale lattice feature graph. The essence of STAR-GNN is to exploit both the temporal dynamics and spatial contents as well as visual connections between regions at different scales in the frames. It models a video with a lattice feature graph in which the nodes represent regions of different granularity, with weighted edges that represent the spatial and temporal links. The contextual nodes are aggregated simultaneously by graph neural networks with parameters trained with retrieval triplet loss. In the experiments, we show that STAR-GNN effectively implements a dynamic attention mechanism on video frame sequences, resulting in the emphasis for dynamic and semantically rich content in the video, and is robust to noise and redundancies. Empirical results show that STAR-GNN achieves state-of-the-art performance for Content-Based Video Retrieval.

READ FULL TEXT

page 1

page 3

research
10/10/2021

CoRGi: Content-Rich Graph Neural Networks with Attention

Graph representations of a target domain often project it to a set of en...
research
03/01/2022

Motion-aware Dynamic Graph Neural Network for Video Compressive Sensing

Video snapshot compressive imaging (SCI) utilizes a 2D detector to captu...
research
08/04/2020

Boundary Content Graph Neural Network for Temporal Action Proposal Generation

Temporal action proposal generation plays an important role in video act...
research
09/12/2022

Landmark Enhanced Multimodal Graph Learning for Deepfake Video Detection

With the rapid development of face forgery technology, deepfake videos h...
research
05/12/2022

S3E-GNN: Sparse Spatial Scene Embedding with Graph Neural Networks for Camera Relocalization

Camera relocalization is the key component of simultaneous localization ...
research
10/05/2020

Neural Generation of Blocks for Video Coding

Well-trained generative neural networks (GNN) are very efficient at comp...
research
04/24/2018

Robust Video Content Alignment and Compensation for Clear Vision Through the Rain

Outdoor vision-based systems suffer from atmospheric turbulences, and ra...

Please sign up or login with your details

Forgot password? Click here to reset