Feature Disentanglement Learning with Switching and Aggregation for Video-based Person Re-Identification

12/16/2022
by   Minjung Kim, et al.
0

In video person re-identification (Re-ID), the network must consistently extract features of the target person from successive frames. Existing methods tend to focus only on how to use temporal information, which often leads to networks being fooled by similar appearances and same backgrounds. In this paper, we propose a Disentanglement and Switching and Aggregation Network (DSANet), which segregates the features representing identity and features based on camera characteristics, and pays more attention to ID information. We also introduce an auxiliary task that utilizes a new pair of features created through switching and aggregation to increase the network's capability for various camera scenarios. Furthermore, we devise a Target Localization Module (TLM) that extracts robust features against a change in the position of the target according to the frame flow and a Frame Weight Generation (FWG) that reflects temporal information in the final representation. Various loss functions for disentanglement learning are designed so that each component of the network can cooperate while satisfactorily performing its own role. Quantitative and qualitative results from extensive experiments demonstrate the superiority of DSANet over state-of-the-art methods on three benchmark datasets.

READ FULL TEXT

page 1

page 3

page 4

page 6

page 7

page 8

research
12/26/2018

Spatial and Temporal Mutual Promotion for Video-based Person Re-identification

Video-based person re-identification is a crucial task of matching video...
research
02/22/2018

Video Person Re-identification by Temporal Residual Learning

In this paper, we propose a novel feature learning framework for video p...
research
08/11/2019

Temporal Knowledge Propagation for Image-to-Video Person Re-identification

In many scenarios of Person Re-identification (Re-ID), the gallery set c...
research
05/05/2019

Intra-clip Aggregation for Video Person Re-identification

Video-based person re-id has drawn much attention in recent years due to...
research
03/12/2019

Learning Feature Aggregation in Temporal Domain for Re-Identification

Person re-identification is a standard and established problem in the co...
research
10/22/2021

Local-Global Associative Frame Assemble in Video Re-ID

Noisy and unrepresentative frames in automatically generated object boun...
research
07/06/2022

Context Sensing Attention Network for Video-based Person Re-identification

Video-based person re-identification (ReID) is challenging due to the pr...

Please sign up or login with your details

Forgot password? Click here to reset