GeoGraph: Learning graph-based multi-view object detection with geometric cues end-to-end

03/23/2020
by   Ahmed Samy Nassar, et al.
37

In this paper we propose an end-to-end learnable approach that detects static urban objects from multiple views, re-identifies instances, and finally assigns a geographic position per object. Our method relies on a Graph Neural Network (GNN) to, detect all objects and output their geographic positions given images and approximate camera poses as input. Our GNN simultaneously models relative pose and image evidence, and is further able to deal with an arbitrary number of input views. Our method is robust to occlusion, with similar appearance of neighboring objects, and severe changes in viewpoints by jointly reasoning about visual image appearance and relative pose. Experimental evaluation on two challenging, large-scale datasets and comparison with state-of-the-art methods show significant and systematic improvements both in accuracy and efficiency, with 2-6 of training time.

READ FULL TEXT

page 2

page 4

page 6

page 7

page 10

research
07/25/2019

Simultaneous multi-view instance detection with learned geometric soft-constraints

We propose to jointly learn multi-view geometry and warping between view...
research
04/26/2023

Graph-CoVis: GNN-based Multi-view Panorama Global Pose Estimation

In this paper, we address the problem of wide-baseline camera pose estim...
research
08/29/2019

CorNet: Generic 3D Corners for 6D Pose Estimation of New Objects without Retraining

We present a novel approach to the detection and 3D pose estimation of o...
research
12/20/2020

Deep Continuous Fusion for Multi-Sensor 3D Object Detection

In this paper, we propose a novel 3D object detector that can exploit bo...
research
08/23/2021

ODAM: Object Detection, Association, and Mapping using Posed RGB Video

Localizing objects and estimating their extent in 3D is an important ste...
research
05/26/2022

Objects Matter: Learning Object Relation Graph for Robust Camera Relocalization

Visual relocalization aims to estimate the pose of a camera from one or ...
research
12/19/2021

End-to-End Learning of Multi-category 3D Pose and Shape Estimation

In this paper, we study the representation of the shape and pose of obje...

Please sign up or login with your details

Forgot password? Click here to reset