Task Specific Attention is one more thing you need for object detection

02/18/2022
by   Sang Yon Lee, et al.
0

Various models have been proposed to solve the object detection problem. However, most of them require many hand-designed components to demonstrate good performance. To mitigate these issues, Transformer based DETR and its variant Deformable DETR were suggested. They solved much of the complex issue of designing a head of object detection model but it has not been generally clear that the Transformer-based models could be considered as the state-of-the-art method in object detection without doubt. Furthermore, as DETR adapted Transformer method only for the detection head, but still with including CNN for the backbone body, it has not been certain that it would be possible to build the competent end-to-end pipeline with the combination of attention modules. In this paper, we propose that combining several attention modules with our new Task Specific Split Transformer(TSST) is a fairly good enough method to produce the best COCO results without traditionally hand-designed components. By splitting generally purposed attention module into two separated mission specific attention module, the proposed method addresses the way to design simpler object detection models than before. Extensive experiments on the COCO benchmark demonstrate the effectiveness of our approach. Code is released at https://github.com/navervision/tsst

READ FULL TEXT
research
10/08/2020

Deformable DETR: Deformable Transformers for End-to-End Object Detection

DETR has been recently proposed to eliminate the need for many hand-desi...
research
06/15/2021

Dynamic Head: Unifying Object Detection Heads with Attentions

The complex nature of combining localization and classification in objec...
research
05/23/2021

End-to-End Video Object Detection with Spatial-Temporal Transformers

Recently, DETR and Deformable DETR have been proposed to eliminate the n...
research
01/13/2022

TransVOD: End-to-end Video Object Detection with Spatial-Temporal Transformers

Detection Transformer (DETR) and Deformable DETR have been proposed to e...
research
06/26/2023

CST-YOLO: A Novel Method for Blood Cell Detection Based on Improved YOLOv7 and CNN-Swin Transformer

Blood cell detection is a typical small-scale object detection problem i...
research
10/13/2022

Application-Driven AI Paradigm for Hand-Held Action Detection

In practical applications especially with safety requirement, some hand-...
research
05/27/2019

FAN: Focused Attention Networks

Attention networks show promise for both vision and language tasks, by e...

Please sign up or login with your details

Forgot password? Click here to reset