BAANet: Learning Bi-directional Adaptive Attention Gates for Multispectral Pedestrian Detection

12/04/2021
by   Xiaoxiao Yang, et al.
0

Thermal infrared (TIR) image has proven effectiveness in providing temperature cues to the RGB features for multispectral pedestrian detection. Most existing methods directly inject the TIR modality into the RGB-based framework or simply ensemble the results of two modalities. This, however, could lead to inferior detection performance, as the RGB and TIR features generally have modality-specific noise, which might worsen the features along with the propagation of the network. Therefore, this work proposes an effective and efficient cross-modality fusion module called Bi-directional Adaptive Attention Gate (BAA-Gate). Based on the attention mechanism, the BAA-Gate is devised to distill the informative features and recalibrate the representations asymptotically. Concretely, a bi-direction multi-stage fusion strategy is adopted to progressively optimize features of two modalities and retain their specificity during the propagation. Moreover, an adaptive interaction of BAA-Gate is introduced by the illumination-based weighting strategy to adaptively adjust the recalibrating and aggregating strength in the BAA-Gate and enhance the robustness towards illumination changes. Considerable experiments on the challenging KAIST dataset demonstrate the superior performance of our method with satisfactory speed.

READ FULL TEXT

page 1

page 3

page 4

page 7

research
07/17/2020

Bi-directional Cross-Modality Feature Propagation with Separation-and-Aggregation Gate for RGB-D Semantic Segmentation

Depth information has proven to be a useful cue in the semantic segmenta...
research
08/23/2023

Cross-Modality Proposal-guided Feature Mining for Unregistered RGB-Thermal Pedestrian Detection

RGB-Thermal (RGB-T) pedestrian detection aims to locate the pedestrians ...
research
08/07/2020

Improving Multispectral Pedestrian Detection by Addressing Modality Imbalance Problems

Multispectral pedestrian detection is capable of adapting to insufficien...
research
08/30/2023

Adaptive Multi-Modalities Fusion in Sequential Recommendation Systems

In sequential recommendation, multi-modal information (e.g., text or ima...
research
03/14/2018

Illumination-aware Faster R-CNN for Robust Multispectral Pedestrian Detection

Multispectral images of color-thermal pairs have shown more effective th...
research
02/01/2023

Multispectral Pedestrian Detection via Reference Box Constrained Cross Attention and Modality Balanced Optimization

Multispectral pedestrian detection is an important task for many around-...
research
01/09/2019

The Cross-Modality Disparity Problem in Multispectral Pedestrian Detection

Aggregating extra features of novel modality brings great advantages for...

Please sign up or login with your details

Forgot password? Click here to reset