GiT: Graph Interactive Transformer for Vehicle Re-identification

07/12/2021
by   Fei Shen, et al.
0

Transformers are more and more popular in computer vision, which treat an image as a sequence of patches and learn robust global features from the sequence. However, a suitable vehicle re-identification method should consider both robust global features and discriminative local features. In this paper, we propose a graph interactive transformer (GiT) for vehicle re-identification. On the whole, we stack multiple GiT blocks to build a competitive vehicle re-identification model, in where each GiT block employs a novel local correlation graph (LCG) module to extract discriminative local features within patches and uses a transformer layer to extract robust global features among patches. In detail, in the current GiT block, the LCG module learns local features from local and global features resulting from the LCG module and transformer layer of the previous GiT block. Similarly, the transformer layer learns global features from the global features generated by the transformer layer of the previous GiT block and the new local features outputted via the LCG module of the current GiT block. Therefore, LCG modules and transformer layers are in a coupled status, bringing effective cooperation between local and global features. This is the first work to combine graphs and transformers for vehicle re-identification to the best of our knowledge. Extensive experiments on three large-scale vehicle re-identification datasets demonstrate that our method is superior to state-of-the-art approaches. The code will be available soon.

READ FULL TEXT

page 1

page 3

page 8

research
05/24/2023

InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition

The local and global features are both essential for automatic speech re...
research
01/30/2022

Aggregating Global Features into Local Vision Transformer

Local Transformer-based classification models have recently achieved pro...
research
02/08/2021

TransReID: Transformer-based Object Re-Identification

In this paper, we explore the Vision Transformer (ViT), a pure transform...
research
02/14/2023

Graph-based Village Level Poverty Identification

Poverty status identification is the first obstacle to eradicating pover...
research
09/15/2021

Hybrid Local-Global Transformer for Image Dehazing

Recently, the Vision Transformer (ViT) has shown impressive performance ...
research
01/01/2018

Script Identification in Natural Scene Image and Video Frame using Attention based Convolutional-LSTM Network

Script identification facilitates many important applications in documen...
research
11/10/2022

HSGNet: Object Re-identification with Hierarchical Similarity Graph Network

Object re-identification method is made up of backbone network, feature ...

Please sign up or login with your details

Forgot password? Click here to reset