Spatially and Temporally Efficient Non-local Attention Network for Video-based Person Re-Identification

08/05/2019
by   Chih-Ting Liu, et al.
1

Video-based person re-identification (Re-ID) aims at matching video sequences of pedestrians across non-overlapping cameras. It is a practical yet challenging task of how to embed spatial and temporal information of a video into its feature representation. While most existing methods learn the video characteristics by aggregating image-wise features and designing attention mechanisms in Neural Networks, they only explore the correlation between frames at high-level features. In this work, we target at refining the intermediate features as well as high-level features with non-local attention operations and make two contributions. (i) We propose a Non-local Video Attention Network (NVAN) to incorporate video characteristics into the representation at multiple feature levels. (ii) We further introduce a Spatially and Temporally Efficient Non-local Video Attention Network (STE-NVAN) to reduce the computation complexity by exploring spatial and temporal redundancy presented in pedestrian videos. Extensive experiments show that our NVAN outperforms state-of-the-arts by 3.8 much superior computation footprint compared to existing methods.

READ FULL TEXT

page 1

page 6

research
07/12/2018

Video-based Person Re-identification via 3D Convolutional Networks and Non-local Attention

Video-based person re-identification (ReID) is a challenging problem, wh...
research
05/22/2021

Video-based Person Re-identification without Bells and Whistles

Video-based person re-identification (Re-ID) aims at matching the video ...
research
01/01/2022

Dynamic Scene Video Deblurring using Non-Local Attention

This paper tackles the challenging problem of video deblurring. Most of ...
research
02/07/2020

iqiyi Submission to ActivityNet Challenge 2019 Kinetics-700 challenge: Hierarchical Group-wise Attention

In this report, the method for the iqiyi submission to the task of Activ...
research
07/16/2018

SCAN: Self-and-Collaborative Attention Network for Video Person Re-identification

Video person re-identification attracts much attention in recent years. ...
research
08/14/2020

Not 3D Re-ID: a Simple Single Stream 2D Convolution for Robust Video Re-identification

Video-based person re-identification has received increasing attention r...
research
11/29/2018

Parameter-Free Spatial Attention Network for Person Re-Identification

Global average pooling (GAP) allows to localize discriminative informati...

Please sign up or login with your details

Forgot password? Click here to reset