Directly Optimizing IoU for Bounding Box Localization

04/14/2023
by   Mofassir ul Islam Arif, et al.
0

Object detection has seen remarkable progress in recent years with the introduction of Convolutional Neural Networks (CNN). Object detection is a multi-task learning problem where both the position of the objects in the images as well as their classes needs to be correctly identified. The idea here is to maximize the overlap between the ground-truth bounding boxes and the predictions i.e. the Intersection over Union (IoU). In the scope of work seen currently in this domain, IoU is approximated by using the Huber loss as a proxy but this indirect method does not leverage the IoU information and treats the bounding box as four independent, unrelated terms of regression. This is not true for a bounding box where the four coordinates are highly correlated and hold a semantic meaning when taken together. The direct optimization of the IoU is not possible due to its non-convex and non-differentiable nature. In this paper, we have formulated a novel loss namely, the Smooth IoU, which directly optimizes the IoUs for the bounding boxes. This loss has been evaluated on the Oxford IIIT Pets, Udacity self-driving car, PASCAL VOC, and VWFS Car Damage datasets and has shown performance gains over the standard Huber loss.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/25/2019

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

Intersection over Union (IoU) is the most popular evaluation metric used...
research
08/04/2016

UnitBox: An Advanced Object Detection Network

In present object detection systems, the deep convolutional neural netwo...
research
03/27/2023

Intersection over Union with smoothing for bounding box regression

We focus on the construction of a loss function for the bounding box reg...
research
09/13/2023

Polygon Intersection-over-Union Loss for Viewpoint-Agnostic Monocular 3D Vehicle Detection

Monocular 3D object detection is a challenging task because depth inform...
research
06/14/2021

Automated Parking Space Detection Using Convolutional Neural Networks

Finding a parking space nowadays becomes an issue that is not to be negl...
research
02/18/2020

FeatureNMS: Non-Maximum Suppression by Learning Feature Embeddings

Most state of the art object detectors output multiple detections per ob...
research
11/16/2018

Ground Plane Polling for 6DoF Pose Estimation of Objects on the Road

This paper introduces an approach to produce accurate 3D detection boxes...

Please sign up or login with your details

Forgot password? Click here to reset