Learning to Navigate the Energy Landscape

03/18/2016
by   Julien Valentin, et al.
0

In this paper, we present a novel and efficient architecture for addressing computer vision problems that use `Analysis by Synthesis'. Analysis by synthesis involves the minimization of the reconstruction error which is typically a non-convex function of the latent target variables. State-of-the-art methods adopt a hybrid scheme where discriminatively trained predictors like Random Forests or Convolutional Neural Networks are used to initialize local search algorithms. While these methods have been shown to produce promising results, they often get stuck in local optima. Our method goes beyond the conventional hybrid architecture by not only proposing multiple accurate initial solutions but by also defining a navigational structure over the solution space that can be used for extremely efficient gradient-free local search. We demonstrate the efficacy of our approach on the challenging problem of RGB Camera Relocalization. To make the RGB camera relocalization problem particularly challenging, we introduce a new dataset of 3D environments which are significantly larger than those found in other publicly-available datasets. Our experiments reveal that the proposed method is able to achieve state-of-the-art camera relocalization results. We also demonstrate the generalizability of our approach on Hand Pose Estimation and Image Retrieval tasks.

READ FULL TEXT

page 7

page 9

research
11/27/2017

SSD-6D: Making RGB-based 3D detection and 6D pose estimation great again

We present a novel method for detecting 3D model instances and estimatin...
research
10/29/2018

Real-Time RGB-D Camera Pose Estimation in Novel Scenes using a Relocalisation Cascade

Camera pose estimation is an important problem in computer vision. Commo...
research
10/22/2017

Backtracking Regression Forests for Accurate Camera Relocalization

Camera relocalization plays a vital role in many robotics and computer v...
research
04/25/2018

Fast View Synthesis with Deep Stereo Vision

Novel view synthesis is an important problem in computer vision and grap...
research
08/01/2023

Robust Single-view Cone-beam X-ray Pose Estimation with Neural Tuned Tomography (NeTT) and Masked Neural Radiance Fields (mNeRF)

Many tasks performed in image-guided, mini-invasive, medical procedures ...
research
06/18/2017

On the Optimization Landscape of Tensor Decompositions

Non-convex optimization with local search heuristics has been widely use...
research
10/08/2018

Accurate Pouring with an Autonomous Robot Using an RGB-D Camera

Robotic assistants in a home environment are expected to perform various...

Please sign up or login with your details

Forgot password? Click here to reset