RHINO: Rotated DETR with Dynamic Denoising via Hungarian Matching for Oriented Object Detection

05/12/2023
by   Hakjin Lee, et al.
0

With the publication of DINO, a variant of the Detection Transformer (DETR), Detection Transformers are breaking the record in the object detection benchmark with the merits of their end-to-end design and scalability. However, the extension of DETR to oriented object detection has not been thoroughly studied although more benefits from its end-to-end architecture are expected such as removing NMS and anchor-related costs. In this paper, we propose a first strong DINO-based baseline for oriented object detection. We found that straightforward employment of DETRs for oriented object detection does not guarantee non-duplicate prediction, and propose a simple cost to mitigate this. Furthermore, we introduce a dynamic denoising strategy that uses Hungarian matching to filter redundant noised queries and query alignment to preserve matching consistency between Transformer decoder layers. Our proposed model outperforms previous rotated DETRs and other counterparts, achieving state-of-the-art performance in DOTA-v1.0/v1.5/v2.0, and DIOR-R benchmarks.

READ FULL TEXT

page 2

page 7

research
06/06/2021

Oriented Object Detection with Transformer

Object detection with Transformers (DETR) has achieved a competitive per...
research
03/01/2023

D2Q-DETR: Decoupling and Dynamic Queries for Oriented Object Detection with Transformers

Despite the promising results, existing oriented object detection method...
research
06/14/2022

Efficient Decoder-free Object Detection with Transformers

Vision transformers (ViTs) are changing the landscape of object detectio...
research
05/25/2022

AO2-DETR: Arbitrary-Oriented Object Detection Transformer

Arbitrary-oriented object detection (AOOD) is a challenging task to dete...
research
06/02/2022

SparseDet: Towards End-to-End 3D Object Detection

In this paper, we propose SparseDet for end-to-end 3D object detection f...
research
09/16/2021

An End-to-End Transformer Model for 3D Object Detection

We propose 3DETR, an end-to-end Transformer based object detection model...
research
04/11/2022

Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection

Human-Object Interaction detection is a holistic visual recognition task...

Please sign up or login with your details

Forgot password? Click here to reset