DeepAI AI Chat
Log In Sign Up

Decoupled Self Attention for Accurate One Stage Object Detection

by   Kehe WU, et al.

As the scale of object detection dataset is smaller than that of image recognition dataset ImageNet, transfer learning has become a basic training method for deep learning object detection models, which will pretrain the backbone network of object detection model on ImageNet dataset to extract features for classification and localization subtasks. However, the classification task focuses on the salient region features of object, while the location task focuses on the edge features of object, so there is certain deviation between the features extracted by pretrained backbone network and the features used for localization task. In order to solve this problem, a decoupled self attention(DSA) module is proposed for one stage object detection models in this paper. DSA includes two decoupled self-attention branches, so it can extract appropriate features for different tasks. It is located between FPN and head networks of subtasks, so it is used to extract global features based on FPN fused features for different tasks independently. Although the network of DSA module is simple, but it can effectively improve the performance of object detection, also it can be easily embedded in many detection models. Our experiments are based on the representative one-stage detection model RetinaNet. In COCO dataset, when ResNet50 and ResNet101 are used as backbone networks, the detection performances can be increased by 0.4 respectively. When DSA module and object confidence task are applied in RetinaNet together, the detection performances based on ResNet50 and ResNet101 can be increased by 1.0 show the effectiveness of DSA module. Code is at:


page 1

page 2

page 3

page 4


YOLOSA: Object detection based on 2D local feature superimposed self-attention

We analyzed the network structure of real-time object detection models a...

CNN-transformer mixed model for object detection

Object detection, one of the three main tasks of computer vision, has be...

Concealed Object Detection for Passive Millimeter-Wave Security Imaging Based on Task-Aligned Detection Transformer

Passive millimeter-wave (PMMW) is a significant potential technique for ...

Excavating RoI Attention for Underwater Object Detection

Self-attention is one of the most successful designs in deep learning, w...

Dynamic Head: Unifying Object Detection Heads with Attentions

The complex nature of combining localization and classification in objec...

GiraffeDet: A Heavy-Neck Paradigm for Object Detection

In conventional object detection frameworks, a backbone body inherited f...

Analysis of Visual Reasoning on One-Stage Object Detection

Current state-of-the-art one-stage object detectors are limited by treat...