D2Q-DETR: Decoupling and Dynamic Queries for Oriented Object Detection with Transformers

by   Qiang Zhou, et al.

Despite the promising results, existing oriented object detection methods usually involve heuristically designed rules, e.g., RRoI generation, rotated NMS. In this paper, we propose an end-to-end framework for oriented object detection, which simplifies the model pipeline and obtains superior performance. Our framework is based on DETR, with the box regression head replaced with a points prediction head. The learning of points is more flexible, and the distribution of points can reflect the angle and size of the target rotated box. We further propose to decouple the query features into classification and regression features, which significantly improves the model precision. Aerial images usually contain thousands of instances. To better balance model precision and efficiency, we propose a novel dynamic query design, which reduces the number of object queries in stacked decoder layers without sacrificing model performance. Finally, we rethink the label assignment strategy of existing DETR-like detectors and propose an effective label re-assignment strategy for improved performance. We name our method D2Q-DETR. Experiments on the largest and challenging DOTA-v1.0 and DOTA-v1.5 datasets show that D2Q-DETR outperforms existing NMS-based and NMS-free oriented object detection methods and achieves the new state-of-the-art.


page 2

page 4


RHINO: Rotated DETR with Dynamic Denoising via Hungarian Matching for Oriented Object Detection

With the publication of DINO, a variant of the Detection Transformer (DE...

ARS-DETR: Aspect Ratio Sensitive Oriented Object Detection with Transformer

Existing oriented object detection methods commonly use metric AP_50 to ...

Point RCNN: An Angle-Free Framework for Rotated Object Detection

Rotated object detection in aerial images is still challenging due to ar...

Bridging the Gap Between End-to-end and Non-End-to-end Multi-Object Tracking

Existing end-to-end Multi-Object Tracking (e2e-MOT) methods have not sur...

Efficient DETR: Improving End-to-End Object Detector with Dense Prior

The recently proposed end-to-end transformer detectors, such as DETR and...

Adaptive Period Embedding for Representing Oriented Objects in Aerial Images

We propose a novel method for representing oriented objects in aerial im...

A General Gaussian Heatmap Labeling for Arbitrary-Oriented Object Detection

Recently, many arbitrary-oriented object detection (AOOD) methods have b...

Please sign up or login with your details

Forgot password? Click here to reset