Efficient Multi-Scale Attention Module with Cross-Spatial Learning

05/23/2023
by   Daliang Ouyang, et al.
0

Remarkable effectiveness of the channel or spatial attention mechanisms for producing more discernible feature representation are illustrated in various computer vision tasks. However, modeling the cross-channel relationships with channel dimensionality reduction may bring side effect in extracting deep visual representations. In this paper, a novel efficient multi-scale attention (EMA) module is proposed. Focusing on retaining the information on per channel and decreasing the computational overhead, we reshape the partly channels into the batch dimensions and group the channel dimensions into multiple sub-features which make the spatial semantic features well-distributed inside each feature group. Specifically, apart from encoding the global information to re-calibrate the channel-wise weight in each parallel branch, the output features of the two parallel branches are further aggregated by a cross-dimension interaction for capturing pixel-level pairwise relationship. We conduct extensive ablation studies and experiments on image classification and object detection tasks with popular benchmarks (e.g., CIFAR-100, ImageNet-1k, MS COCO and VisDrone2019) for evaluating its performance.

READ FULL TEXT
research
01/30/2021

SA-Net: Shuffle Attention for Deep Convolutional Neural Networks

Attention mechanisms, which enable a neural network to accurately focus ...
research
10/06/2020

Rotate to Attend: Convolutional Triplet Attention Module

Benefiting from the capability of building inter-dependencies among chan...
research
12/10/2021

Global Attention Mechanism: Retain Information to Enhance Channel-Spatial Interactions

A variety of attention mechanisms have been studied to improve the perfo...
research
03/19/2021

CE-FPN: Enhancing Channel Information for Object Detection

Feature pyramid network (FPN) has been an effective framework to extract...
research
05/23/2019

Spatial Group-wise Enhance: Improving Semantic Feature Learning in Convolutional Networks

The Convolutional Neural Networks (CNNs) generate the feature representa...
research
08/25/2023

Squeeze aggregated excitation network

Convolutional neural networks have spatial representations which read pa...
research
12/13/2022

CAT: Learning to Collaborate Channel and Spatial Attention from Multi-Information Fusion

Channel and spatial attention mechanism has proven to provide an evident...

Please sign up or login with your details

Forgot password? Click here to reset