Rethinking Transformer-based Set Prediction for Object Detection

11/21/2020
by   Zhiqing Sun, et al.
0

DETR is a recently proposed Transformer-based method which views object detection as a set prediction problem and achieves state-of-the-art performance but demands extra-long training time to converge. In this paper, we investigate the causes of the optimization difficulty in the training of DETR. Our examinations reveal several factors contributing to the slow convergence of DETR, primarily the issues with the Hungarian loss and the Transformer cross attention mechanism. To overcome these issues we propose two solutions, namely, TSP-FCOS (Transformer-based Set Prediction with FCOS) and TSP-RCNN (Transformer-based Set Prediction with RCNN). Experimental results show that the proposed methods not only converge much faster than the original DETR, but also significantly outperform DETR and other baselines in terms of detection accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/12/2022

CenterFormer: Center-based Transformer for 3D Object Detection

Query-based transformer has shown great potential in constructing long-r...
research
01/19/2021

Fast Convergence of DETR with Spatially Modulated Co-Attention

The recently proposed Detection Transformer (DETR) model successfully ap...
research
06/06/2021

Oriented Object Detection with Transformer

Object detection with Transformers (DETR) has achieved a competitive per...
research
08/26/2021

TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-captured Scenarios

Object detection on drone-captured scenarios is a recent popular task. A...
research
10/08/2022

Towards Light Weight Object Detection System

Transformers are a popular choice for classification tasks and as backbo...
research
05/25/2022

AO2-DETR: Arbitrary-Oriented Object Detection Transformer

Arbitrary-oriented object detection (AOOD) is a challenging task to dete...
research
03/23/2018

Speeding-up Object Detection Training for Robotics with FALKON

Latest deep learning methods for object detection provided remarkable pe...

Please sign up or login with your details

Forgot password? Click here to reset