Video-based Person Re-identification with Spatial and Temporal Memory Networks

08/20/2021
by   Chanho Eom, et al.
4

Video-based person re-identification (reID) aims to retrieve person videos with the same identity as a query person across multiple cameras. Spatial and temporal distractors in person videos, such as background clutter and partial occlusions over frames, respectively, make this task much more challenging than image-based person reID. We observe that spatial distractors appear consistently in a particular location, and temporal distractors show several patterns, e.g., partial occlusions occur in the first few frames, where such patterns provide informative cues for predicting which frames to focus on (i.e., temporal attentions). Based on this, we introduce a novel Spatial and Temporal Memory Networks (STMN). The spatial memory stores features for spatial distractors that frequently emerge across video frames, while the temporal memory saves attentions which are optimized for typical temporal patterns in person videos. We leverage the spatial and temporal memories to refine frame-level person representations and to aggregate the refined frame-level features into a sequence-level person representation, respectively, effectively handling spatial and temporal distractors in person videos. We also introduce a memory spread loss preventing our model from addressing particular items only in the memories. Experimental results on standard benchmarks, including MARS, DukeMTMC-VideoReID, and LS-VID, demonstrate the effectiveness of our method.

READ FULL TEXT

page 1

page 5

page 6

page 7

research
10/26/2018

Video-based Person Re-identification Using Spatial-Temporal Attention Networks

We consider the problem of video-based person re-identification. The goa...
research
06/19/2020

A Symbolic Temporal Pooling method for Video-based Person Re-Identification

In video-based person re-identification, both the spatial and temporal f...
research
11/09/2018

STA: Spatial-Temporal Attention for Large-Scale Video-based Person Re-Identification

In this work, we propose a novel Spatial-Temporal Attention (STA) approa...
research
01/02/2023

Multi-Stage Spatio-Temporal Aggregation Transformer for Video Person Re-identification

In recent years, the Transformer architecture has shown its superiority ...
research
07/15/2022

Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection

Active speaker detection (ASD) in videos with multiple speakers is a cha...
research
02/13/2019

Person Re-identification in Videos by Analyzing Spatio-Temporal Tubes

Typical person re-identification frameworks search for k best matches in...
research
08/10/2023

Co-movement Pattern Mining from Videos

Co-movement pattern mining from GPS trajectories has been an intriguing ...

Please sign up or login with your details

Forgot password? Click here to reset