CMTR: Cross-modality Transformer for Visible-infrared Person Re-identification

10/18/2021
by   Tengfei Liang, et al.
13

Visible-infrared cross-modality person re-identification is a challenging ReID task, which aims to retrieve and match the same identity's images between the heterogeneous visible and infrared modalities. Thus, the core of this task is to bridge the huge gap between these two modalities. The existing convolutional neural network-based methods mainly face the problem of insufficient perception of modalities' information, and can not learn good discriminative modality-invariant embeddings for identities, which limits their performance. To solve these problems, we propose a cross-modality transformer-based method (CMTR) for the visible-infrared person re-identification task, which can explicitly mine the information of each modality and generate better discriminative features based on it. Specifically, to capture modalities' characteristics, we design the novel modality embeddings, which are fused with token embeddings to encode modalities' information. Furthermore, to enhance representation of modality embeddings and adjust matching embeddings' distribution, we propose a modality-aware enhancement loss based on the learned modalities' information, reducing intra-class distance and enlarging inter-class distance. To our knowledge, this is the first work of applying transformer network to the cross-modality re-identification task. We implement extensive experiments on the public SYSU-MM01 and RegDB datasets, and our proposed CMTR model's performance significantly surpasses existing outstanding CNN-based methods.

READ FULL TEXT

page 1

page 3

page 7

research
09/12/2023

Modality Unifying Network for Visible-Infrared Person Re-Identification

Visible-infrared person re-identification (VI-ReID) is a challenging tas...
research
02/16/2023

Visible-Infrared Person Re-Identification via Patch-Mixed Cross-Modality Learning

Visible-infrared person re-identification (VI-ReID) aims to retrieve ima...
research
11/01/2021

Benchmarks for Corruption Invariant Person Re-identification

When deploying person re-identification (ReID) model in safety-critical ...
research
06/02/2020

Ear2Face: Deep Biometric Modality Mapping

In this paper, we explore the correlation between different visual biome...
research
07/17/2023

Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification

Visible-Infrared person Re-IDentification (VI-ReID) is a challenging cro...
research
05/22/2023

Efficient Bilateral Cross-Modality Cluster Matching for Unsupervised Visible-Infrared Person ReID

Unsupervised visible-infrared person re-identification (USL-VI-ReID) aim...
research
08/06/2020

Dual Gaussian-based Variational Subspace Disentanglement for Visible-Infrared Person Re-Identification

Visible-infrared person re-identification (VI-ReID) is a challenging and...

Please sign up or login with your details

Forgot password? Click here to reset