HandFlow: Quantifying View-Dependent 3D Ambiguity in Two-Hand Reconstruction with Normalizing Flow

10/04/2022
by   Jiayi Wang, et al.
19

Reconstructing two-hand interactions from a single image is a challenging problem due to ambiguities that stem from projective geometry and heavy occlusions. Existing methods are designed to estimate only a single pose, despite the fact that there exist other valid reconstructions that fit the image evidence equally well. In this paper we propose to address this issue by explicitly modeling the distribution of plausible reconstructions in a conditional normalizing flow framework. This allows us to directly supervise the posterior distribution through a novel determinant magnitude regularization, which is key to varied 3D hand pose samples that project well into the input image. We also demonstrate that metrics commonly used to assess reconstruction quality are insufficient to evaluate pose predictions under such severe ambiguity. To address this, we release the first dataset with multiple plausible annotations per image called MultiHands. The additional annotations enable us to evaluate the estimated distribution using the maximum mean discrepancy metric. Through this, we demonstrate the quality of our probabilistic reconstruction and show that explicit ambiguity modeling is better-suited for this challenging problem.

READ FULL TEXT

page 1

page 3

page 5

page 7

research
03/22/2021

Model-based 3D Hand Reconstruction via Self-Supervised Learning

Reconstructing a 3D hand from a single-view RGB image is challenging due...
research
09/14/2023

HandNeRF: Learning to Reconstruct Hand-Object Interaction Scene from a Single RGB Image

This paper presents a method to learn hand-object interaction prior for ...
research
07/08/2020

Adaptive 3D Face Reconstruction from a Single Image

3D face reconstruction from a single image is a challenging problem, esp...
research
12/02/2016

A Point Set Generation Network for 3D Object Reconstruction from a Single Image

Generation of 3D data by deep neural network has been attracting increas...
research
06/01/2023

BUOL: A Bottom-Up Framework with Occupancy-aware Lifting for Panoptic 3D Scene Reconstruction From A Single Image

Understanding and modeling the 3D scene from a single image is a practic...
research
08/19/2021

How to cheat with metrics in single-image HDR reconstruction

Single-image high dynamic range (SI-HDR) reconstruction has recently eme...
research
12/05/2021

Deblurring via Stochastic Refinement

Image deblurring is an ill-posed problem with multiple plausible solutio...

Please sign up or login with your details

Forgot password? Click here to reset