Mask2Hand: Learning to Predict the 3D Hand Pose and Shape from Shadow

05/31/2022
by   Li-Jen Chang, et al.
0

We present a self-trainable method, Mask2Hand, which learns to solve the challenging task of predicting 3D hand pose and shape from a 2D binary mask of hand silhouette/shadow without additional manually-annotated data. Given the intrinsic camera parameters and the parametric hand model in the camera space, we adopt the differentiable rendering technique to project 3D estimations onto the 2D binary silhouette space. By applying a tailored combination of losses between the rendered silhouette and the input binary mask, we are able to integrate the self-guidance mechanism into our end-to-end optimization process for constraining global mesh registration and hand pose estimation. The experiments show that our method, which takes a single binary mask as the input, can achieve comparable prediction accuracy on both unaligned and aligned settings as state-of-the-art methods that require RGB or depth inputs.

READ FULL TEXT

page 2

page 21

page 22

page 23

research
04/08/2019

Pushing the Envelope for RGB-based Dense 3D Hand Pose Estimation via Neural Rendering

Estimating 3D hand meshes from single RGB images is challenging, due to ...
research
08/28/2018

DeepHPS: End-to-end Estimation of 3D Hand Pose and Shape by Learning from Synthetic Depth

Articulated hand pose and shape estimation is an important problem for v...
research
11/29/2017

Occlusion-aware Hand Pose Estimation Using Hierarchical Mixture Density Network

Hand pose estimation is to predict the pose parameters representing a 3D...
research
12/28/2019

Silhouette-Net: 3D Hand Pose Estimation from Silhouettes

3D hand pose estimation has received a lot of attention for its wide ran...
research
05/07/2023

Neural Voting Field for Camera-Space 3D Hand Pose Estimation

We present a unified framework for camera-space 3D hand pose estimation ...
research
07/24/2021

Hand Image Understanding via Deep Multi-Task Learning

Analyzing and understanding hand information from multimedia materials l...
research
09/25/2021

Fully Differentiable and Interpretable Model for VIO with 4 Trainable Parameters

Monocular visual-inertial odometry (VIO) is a critical problem in roboti...

Please sign up or login with your details

Forgot password? Click here to reset