BiCnet-TKS: Learning Efficient Spatial-Temporal Representation for Video Person Re-Identification

04/30/2021
by   Ruibing Hou, et al.
0

In this paper, we present an efficient spatial-temporal representation for video person re-identification (reID). Firstly, we propose a Bilateral Complementary Network (BiCnet) for spatial complementarity modeling. Specifically, BiCnet contains two branches. Detail Branch processes frames at original resolution to preserve the detailed visual clues, and Context Branch with a down-sampling strategy is employed to capture long-range contexts. On each branch, BiCnet appends multiple parallel and diverse attention modules to discover divergent body parts for consecutive frames, so as to obtain an integral characteristic of target identity. Furthermore, a Temporal Kernel Selection (TKS) block is designed to capture short-term as well as long-term temporal relations by an adaptive mode. TKS can be inserted into BiCnet at any depth to construct BiCnetTKS for spatial-temporal modeling. Experimental results on multiple benchmarks show that BiCnet-TKS outperforms state-of-the-arts with about 50 available at https://github.com/ blue-blue272/BiCnet-TKS.

READ FULL TEXT

page 1

page 2

research
07/18/2020

Temporal Complementary Learning for Video Person Re-Identification

This paper proposes a Temporal Complementary Learning Network that extra...
research
04/10/2020

Co-Saliency Spatio-Temporal Interaction Network for Person Re-Identification in Videos

Person re-identification aims at identifying a certain pedestrian across...
research
09/02/2020

IAUnet: Global Context-Aware Feature Learning for Person Re-Identification

Person re-identification (reID) by CNNs based networks has achieved favo...
research
05/22/2021

Video-based Person Re-identification without Bells and Whistles

Video-based person re-identification (Re-ID) aims at matching the video ...
research
09/04/2022

Spatial-Temporal Transformer for Video Snapshot Compressive Imaging

Video snapshot compressive imaging (SCI) captures multiple sequential vi...
research
08/07/2023

Video-based Person Re-identification with Long Short-Term Representation Learning

Video-based person Re-Identification (V-ReID) aims to retrieve specific ...
research
01/26/2023

Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring

Image-text pretrained models, e.g., CLIP, have shown impressive general ...

Please sign up or login with your details

Forgot password? Click here to reset