3D PersonVLAD: Learning Deep Global Representations for Video-based Person Re-identification

12/26/2018
by   Lin Wu, et al.
12

In this paper, we introduce a global video representation to video-based person re-identification (re-ID) that aggregates local 3D features across the entire video extent. Most of the existing methods rely on 2D convolutional networks (ConvNets) to extract frame-wise deep features which are pooled temporally to generate the video-level representations. However, 2D ConvNets lose temporal input information immediately after the convolution, and a separate temporal pooling is limited in capturing human motion in shorter sequences. To this end, we present a global video representation (3D PersonVLAD), complementary to 3D ConvNets as a novel layer to capture the appearance and motion dynamics in full-length videos. However, encoding each video frame in its entirety and computing an aggregate global representation across all frames is tremendously challenging due to occlusions and misalignments. To resolve this, our proposed network is further augmented with 3D part alignment module to learn local features through soft-attention module. These attended features are statistically aggregated to yield identity-discriminative representations. Our global 3D features are demonstrated to achieve state-of-the-art results on three benchmark datasets: MARS MARS, iLIDS-VID VideoRanking, and PRID 2011

READ FULL TEXT

page 2

page 3

page 5

page 6

page 9

page 10

page 11

page 12

research
10/26/2018

Video-based Person Re-identification Using Spatial-Temporal Attention Networks

We consider the problem of video-based person re-identification. The goa...
research
07/25/2021

Spatio-Temporal Representation Factorization for Video-based Person Re-Identification

Despite much recent progress in video-based person re-identification (re...
research
10/22/2021

Local-Global Associative Frame Assemble in Video Re-ID

Noisy and unrepresentative frames in automatically generated object boun...
research
03/21/2020

Video-based Person Re-Identification using Gated Convolutional Recurrent Neural Networks

Deep neural networks have been successfully applied to solving the video...
research
11/28/2019

Rethinking Temporal Fusion for Video-based Person Re-identification on Semantic and Time Aspect

Recently, the research interest of person re-identification (ReID) has g...
research
06/10/2021

Unsupervised Video Person Re-identification via Noise and Hard frame Aware Clustering

Unsupervised video-based person re-identification (re-ID) methods extrac...
research
10/22/2021

Wide and Narrow: Video Prediction from Context and Motion

Video prediction, forecasting the future frames from a sequence of input...

Please sign up or login with your details

Forgot password? Click here to reset