Ellipse Regression with Predicted Uncertainties for Accurate Multi-View 3D Object Estimation

12/27/2020
by   Wenbo Dong, et al.
0

Convolutional neural network (CNN) based architectures, such as Mask R-CNN, constitute the state of the art in object detection and segmentation. Recently, these methods have been extended for model-based segmentation where the network outputs the parameters of a geometric model (e.g. an ellipse) directly. This work considers objects whose three-dimensional models can be represented as ellipsoids. We present a variant of Mask R-CNN for estimating the parameters of ellipsoidal objects by segmenting each object and accurately regressing the parameters of projection ellipses. We show that model regression is sensitive to the underlying occlusion scenario and that prediction quality for each object needs to be characterized individually for accurate 3D object estimation. We present a novel ellipse regression loss which can learn the offset parameters with their uncertainties and quantify the overall geometric quality of detection for each ellipse. These values, in turn, allow us to fuse multi-view detections to obtain 3D ellipsoid parameters in a principled fashion. The experiments on both synthetic and real datasets quantitatively demonstrate the high accuracy of our proposed method in estimating 3D objects under heavy occlusions compared to previous state-of-the-art methods.

READ FULL TEXT
research
01/30/2020

Ellipse R-CNN: Learning to Infer Elliptical Object from Clustering and Occlusion

Images of heavily occluded objects in cluttered scenes, such as fruit cl...
research
03/21/2018

A Unified Framework for Multi-View Multi-Class Object Pose Estimation

One core challenge in object pose estimation is to ensure accurate and r...
research
05/30/2022

MVMO: A Multi-Object Dataset for Wide Baseline Multi-View Semantic Segmentation

We present MVMO (Multi-View, Multi-Object dataset): a synthetic dataset ...
research
11/05/2018

SPNet: Deep 3D Object Classification and Retrieval using Stereographic Projection

We propose an efficient Stereographic Projection Neural Network (SPNet) ...
research
07/02/2022

ORA3D: Overlap Region Aware Multi-view 3D Object Detection

In multi-view 3D object detection tasks, disparity supervision over over...
research
06/05/2023

Human Spine Motion Capture using Perforated Kinesiology Tape

In this work, we present a marker-based multi-view spine tracking method...
research
07/20/2020

Cephalometric Landmark Regression with Convolutional Neural Networks on 3D Computed Tomography Data

In this paper, we address the problem of automatic three-dimensional cep...

Please sign up or login with your details

Forgot password? Click here to reset