6D Camera Relocalization in Ambiguous Scenes via Continuous Multimodal Inference

04/09/2020
by   Mai Bui, et al.
10

We present a multimodal camera relocalization framework that captures ambiguities and uncertainties with continuous mixture models defined on the manifold of camera poses. In highly ambiguous environments, which can easily arise due to symmetries and repetitive structures in the scene, computing one plausible solution (what most state-of-the-art methods currently regress) may not be sufficient. Instead we predict multiple camera pose hypotheses as well as the respective uncertainty for each prediction. Towards this aim, we use Bingham distributions, to model the orientation of the camera pose, and a multivariate Gaussian to model the position, with an end-to-end deep neural network. By incorporating a Winner-Takes-All training scheme, we finally obtain a mixture model that is well suited for explaining ambiguities in the scene, yet does not suffer from mode collapse, a common problem with mixture density networks. We introduce a new dataset specifically designed to foster camera localization research in ambiguous environments and exhaustively evaluate our method on synthetic as well as real data on both ambiguous scenes and on non-ambiguous benchmark datasets. We plan to release our code and dataset under $\href{https://multimodal3dvision.github.io}{multimodal3dvision.github.io}$.

READ FULL TEXT

page 2

page 12

page 13

page 21

page 23

page 24

page 27

page 28

research
01/05/2023

A Probabilistic Framework for Visual Localization in Ambiguous Scenes

Visual localization allows autonomous robots to relocalize when losing t...
research
12/20/2020

Deep Bingham Networks: Dealing with Uncertainty and Ambiguity in Pose Estimation

In this work, we introduce Deep Bingham Networks (DBN), a generic framew...
research
07/13/2022

6D Camera Relocalization in Visually Ambiguous Extreme Environments

We propose a novel method to reliably estimate the pose of a camera give...
research
06/09/2019

Overcoming Limitations of Mixture Density Networks: A Sampling and Fitting Framework for Multimodal Future Prediction

Future prediction is a fundamental principle of intelligence that helps ...
research
03/21/2021

Learning Multi-Scene Absolute Pose Regression with Transformers

Absolute camera pose regressors estimate the position and orientation of...
research
06/24/2020

GMMLoc: Structure Consistent Visual Localization with Gaussian Mixture Models

Incorporating prior structure information into the visual state estimati...
research
11/02/2020

3D Multi-bodies: Fitting Sets of Plausible 3D Human Models to Ambiguous Image Data

We consider the problem of obtaining dense 3D reconstructions of humans ...

Please sign up or login with your details

Forgot password? Click here to reset