Scatter Points in Space: 3D Detection from Multi-view Monocular Images

08/31/2022
by   Jianlin Liu, et al.
30

3D object detection from monocular image(s) is a challenging and long-standing problem of computer vision. To combine information from different perspectives without troublesome 2D instance tracking, recent methods tend to aggregate multiview feature by sampling regular 3D grid densely in space, which is inefficient. In this paper, we attempt to improve multi-view feature aggregation by proposing a learnable keypoints sampling method, which scatters pseudo surface points in 3D space, in order to keep data sparsity. The scattered points augmented by multi-view geometric constraints and visual features are then employed to infer objects location and shape in the scene. To make up the limitations of single frame and model multi-view geometry explicitly, we further propose a surface filter module for noise suppression. Experimental results show that our method achieves significantly better performance than previous works in terms of 3D detection (more than 0.1 AP improvement on some categories of ScanNet). The code will be publicly available.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

research
09/22/2021

MVM3Det: A Novel Method for Multi-view Monocular 3D Detection

Monocular 3D object detection encounters occlusion problems in many appl...
research
07/25/2019

Simultaneous multi-view instance detection with learned geometric soft-constraints

We propose to jointly learn multi-view geometry and warping between view...
research
06/01/2018

CubeSLAM: Monocular 3D Object Detection and SLAM without Prior Models

We present a method for single image 3D cuboid object detection and mult...
research
10/04/2018

Multi-view X-ray R-CNN

Motivated by the detection of prohibited objects in carry-on luggage as ...
research
04/25/2023

MMRDN: Consistent Representation for Multi-View Manipulation Relationship Detection in Object-Stacked Scenes

Manipulation relationship detection (MRD) aims to guide the robot to gra...
research
08/31/2023

GHuNeRF: Generalizable Human NeRF from a Monocular Video

In this paper, we tackle the challenging task of learning a generalizabl...
research
09/30/2018

Marrying Tracking with ELM: A Metric Constraint Guided Multiple Feature Fusion Method

Object Tracking is one important problem in computer vision and surveill...

Please sign up or login with your details

Forgot password? Click here to reset