Silhouette-Net: 3D Hand Pose Estimation from Silhouettes

by   Kuo-Wei Lee, et al.

3D hand pose estimation has received a lot of attention for its wide range of applications and has made great progress owing to the development of deep learning. Existing approaches mainly consider different input modalities and settings, such as monocular RGB, multi-view RGB, depth, or point cloud, to provide sufficient cues for resolving variations caused by self occlusion and viewpoint change. In contrast, this work aims to address the less-explored idea of using minimal information to estimate 3D hand poses. We present a new architecture that automatically learns a guidance from implicit depth perception and solves the ambiguity of hand pose through end-to-end training. The experimental results show that 3D hand poses can be accurately estimated from solely hand silhouettes without using depth maps. Extensive evaluations on the 2017 Hands In the Million Challenge (HIM2017) benchmark dataset further demonstrate that our method achieves comparable or even better performance than recent depth-based approaches and serves as the state-of-the-art of its own kind on estimating 3D hand poses from silhouettes.


page 4

page 5

page 6


Hand Pose Estimation via Latent 2.5D Heatmap Regression

Estimating the 3D pose of a hand is an essential part of human-computer ...

DGGAN: Depth-image Guided Generative Adversarial Networks for Disentangling RGB and Depth Images in 3D Hand Pose Estimation

Estimating3D hand poses from RGB images is essentialto a wide range of p...

Transformer-based Global 3D Hand Pose Estimation in Two Hands Manipulating Objects Scenarios

This report describes our 1st place solution to ECCV 2022 challenge on H...

HandAugment: A Simple Data Augmentation for HANDS19 Challenge Task 1 – Depth-Based 3D Hand Pose Estimation

Hand pose estimation from 3D depth images, has been explored widely usin...

Mask2Hand: Learning to Predict the 3D Hand Pose and Shape from Shadow

We present a self-trainable method, Mask2Hand, which learns to solve the...

Pyramid Deep Fusion Network for Two-Hand Reconstruction from RGB-D Images

Accurately recovering the dense 3D mesh of both hands from monocular ima...

End-to-end Global to Local CNN Learning for Hand Pose Recovery in Depth data

Despite recent advances in 3D pose estimation of human hands, especially...

Please sign up or login with your details

Forgot password? Click here to reset