Rotated Object Detection via Scale-invariant Mahalanobis Distance in Aerial Images

04/02/2022
by   Siyang Wen, et al.
15

Rotated object detection in aerial images is a meaningful yet challenging task as objects are densely arranged and have arbitrary orientations. The eight-parameter (coordinates of box vectors) methods in rotated object detection usually use ln-norm losses (L1 loss, L2 loss, and smooth L1 loss) as loss functions. As ln-norm losses are mainly based on non-scale-invariant Minkowski distance, using ln-norm losses will lead to inconsistency with the detection metric rotational Intersection-over-Union (IoU) and training instability. To address the problems, we use Mahalanobis distance to calculate loss between the predicted and the target box vertices' vectors, proposing a new loss function called Mahalanobis Distance Loss (MDL) for eight-parameter rotated object detection. As Mahalanobis distance is scale-invariant, MDL is more consistent with detection metric and more stable during training than ln-norm losses. To alleviate the problem of boundary discontinuity like all other eight-parameter methods, we further take the minimum loss value to make MDL continuous at boundary cases. We achieve state-of-art performance on DOTA-v1.0 with the proposed method MDL. Furthermore, compared to the experiment that uses smooth L1 loss, we find that MDL performs better in rotated object detection.

READ FULL TEXT

page 2

page 5

research
11/19/2019

Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression

Bounding box regression is the crucial step in object detection. In exis...
research
04/19/2019

Automated Focal Loss for Image based Object Detection

Current state-of-the-art object detection algorithms still suffer the pr...
research
08/11/2019

IoU Loss for 2D/3D Object Detection

In 2D/3D object detection task, Intersection-over-Union (IoU) has been w...
research
07/17/2023

Rethinking Intersection Over Union for Small Object Detection in Few-Shot Regime

In Few-Shot Object Detection (FSOD), detecting small objects is extremel...
research
10/26/2021

Alpha-IoU: A Family of Power Intersection over Union Losses for Bounding Box Regression

Bounding box (bbox) regression is a fundamental task in computer vision....
research
09/13/2023

Polygon Intersection-over-Union Loss for Viewpoint-Agnostic Monocular 3D Vehicle Detection

Monocular 3D object detection is a challenging task because depth inform...
research
12/03/2021

A Systematic IoU-Related Method: Beyond Simplified Regression for Better Localization

Four-variable-independent-regression localization losses, such as Smooth...

Please sign up or login with your details

Forgot password? Click here to reset