I Like to Move It: 6D Pose Estimation as an Action Decision Process

09/26/2020
by   Benjamin Busam, et al.
0

Object pose estimation is an integral part of robot vision and augmented reality. Robust and accurate pose prediction of both object rotation and translation is a crucial element to enable precise and safe human-machine interactions and to allow visualization in mixed reality. Previous 6D pose estimation methods treat the problem either as a regression task or discretize the pose space to classify. We reformulate the problem as an action decision process where an initial pose is updated in incremental discrete steps that sequentially move a virtual 3D rendering towards the correct solution. A neural network estimates likely moves from a single RGB image iteratively and determines so an acceptable final pose. In comparison to previous approaches that learn an object-specific pose embedding, a decision process allows for a lightweight architecture while it naturally generalizes to unseen objects. Moreover, the coherent action for process termination enables dynamic reduction of the computation cost if there are insignificant changes in a video sequence. While other methods only provide a static inference time, we can thereby automatically increase the runtime depending on the object motion. We evaluate robustness and accuracy of our action decision network on video scenes with known and unknown objects and show how this can improve the state-of-the-art on YCB videos significantly.

READ FULL TEXT

page 7

page 8

page 10

page 13

research
06/25/2018

Vision-based Pose Estimation for Augmented Reality : A Comparison Study

Augmented reality aims to enrich our real world by inserting 3D virtual ...
research
03/31/2018

DeepIM: Deep Iterative Matching for 6D Pose Estimation

Estimating the 6D pose of objects from images is an important problem in...
research
11/17/2021

MPF6D: Masked Pyramid Fusion 6D Pose Estimation

Object pose estimation has multiple important applications, such as robo...
research
05/08/2018

A Mixed Classification-Regression Framework for 3D Pose Estimation from 2D Images

3D pose estimation from a single 2D image is an important and challengin...
research
05/23/2022

Keypoint-Based Category-Level Object Pose Tracking from an RGB Sequence with Uncertainty Estimation

We propose a single-stage, category-level 6-DoF pose estimation algorith...
research
12/27/2019

One Point, One Object: Simultaneous 3D Object Segmentation and 6-DOF Pose Estimation

We propose a single-shot method for simultaneous 3D object segmentation ...
research
08/05/2019

TopoTag: A Robust and Scalable Topological Fiducial Marker System

Fiducial markers have been playing an important role in augmented realit...

Please sign up or login with your details

Forgot password? Click here to reset