Jointly Attentive Spatial-Temporal Pooling Networks for Video-based Person Re-Identification

08/03/2017
by   Shuangjie Xu, et al.
0

Person Re-Identification (person re-id) is a crucial task as its applications in visual surveillance and human-computer interaction. In this work, we present a novel joint Spatial and Temporal Attention Pooling Network (ASTPN) for video-based person re-identification, which enables the feature extractor to be aware of the current input video sequences, in a way that interdependency from the matching items can directly influence the computation of each other's representation. Specifically, the spatial pooling layer is able to select regions from each frame, while the attention temporal pooling performed can select informative frames over the sequence, both pooling guided by the information from distance matching. Experiments are conduced on the iLIDS-VID, PRID-2011 and MARS datasets and the results demonstrate that this approach outperforms existing state-of-art methods. We also analyze how the joint pooling in both dimensions can boost the person re-id performance more effectively than using either of them separately.

READ FULL TEXT
research
12/26/2018

Spatial and Temporal Mutual Promotion for Video-based Person Re-identification

Video-based person re-identification is a crucial task of matching video...
research
03/16/2021

Dense Interaction Learning for Video-based Person Re-identification

Video-based person re-identification (re-ID) aims at matching the same p...
research
07/03/2018

A Spatial and Temporal Features Mixture Model with Body Parts for Video-based Person Re-Identification

The video-based person re-identification is to recognize a person under ...
research
06/19/2020

A Symbolic Temporal Pooling method for Video-based Person Re-Identification

In video-based person re-identification, both the spatial and temporal f...
research
08/03/2019

ABD-Net: Attentive but Diverse Person Re-Identification

Attention mechanism has been shown to be effective for person re-identif...
research
04/27/2023

Deeply-Coupled Convolution-Transformer with Spatial-temporal Complementary Learning for Video-based Person Re-identification

Advanced deep Convolutional Neural Networks (CNNs) have shown great succ...
research
08/14/2020

Not 3D Re-ID: a Simple Single Stream 2D Convolution for Robust Video Re-identification

Video-based person re-identification has received increasing attention r...

Please sign up or login with your details

Forgot password? Click here to reset