GrooMeD-NMS: Grouped Mathematically Differentiable NMS for Monocular 3D Object Detection

03/31/2021
by   Abhinav Kumar, et al.
13

Modern 3D object detectors have immensely benefited from the end-to-end learning idea. However, most of them use a post-processing algorithm called Non-Maximal Suppression (NMS) only during inference. While there were attempts to include NMS in the training pipeline for tasks such as 2D object detection, they have been less widely adopted due to a non-mathematical expression of the NMS. In this paper, we present and integrate GrooMeD-NMS – a novel Grouped Mathematically Differentiable NMS for monocular 3D object detection, such that the network is trained end-to-end with a loss on the boxes after NMS. We first formulate NMS as a matrix operation and then group and mask the boxes in an unsupervised manner to obtain a simple closed-form expression of the NMS. GrooMeD-NMS addresses the mismatch between training and inference pipelines and, therefore, forces the network to select the best 3D box in a differentiable manner. As a result, GrooMeD-NMS achieves state-of-the-art monocular 3D object detection results on the KITTI benchmark dataset performing comparably to monocular video-based methods. Code and models at https://github.com/abhi1kumar/groomed_nms

READ FULL TEXT
research
05/14/2019

Monocular 3D Object Detection via Geometric Reasoning on Keypoints

Monocular 3D object detection is well-known to be a challenging vision t...
research
05/23/2019

Shift R-CNN: Deep Monocular 3D Object Detection with Closed-Form Geometric Constraints

We propose Shift R-CNN, a hybrid model for monocular 3D object detection...
research
03/01/2021

Categorical Depth Distribution Network for Monocular 3D Object Detection

Monocular 3D object detection is a key problem for autonomous vehicles, ...
research
07/21/2022

DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection

Modern neural networks use building blocks such as convolutions that are...
research
05/08/2017

Learning non-maximum suppression

Object detectors have hugely profited from moving towards an end-to-end ...
research
04/02/2022

Homography Loss for Monocular 3D Object Detection

Monocular 3D object detection is an essential task in autonomous driving...
research
12/06/2018

ROI-10D: Monocular Lifting of 2D Detection to 6D Pose and Metric Shape

We present a deep learning method for end-to-end monocular 3D object det...

Please sign up or login with your details

Forgot password? Click here to reset