Consistent 3D Hand Reconstruction in Video via self-supervised Learning

01/24/2022
by   Zhigang Tu, et al.
13

We present a method for reconstructing accurate and consistent 3D hands from a monocular video. We observe that detected 2D hand keypoints and the image texture provide important cues about the geometry and texture of the 3D hand, which can reduce or even eliminate the requirement on 3D hand annotation. Thus we propose S^2HAND, a self-supervised 3D hand reconstruction model, that can jointly estimate pose, shape, texture, and the camera viewpoint from a single RGB input through the supervision of easily accessible 2D detected keypoints. We leverage the continuous hand motion information contained in the unlabeled video data and propose S^2HAND(V), which uses a set of weights shared S^2HAND to process each frame and exploits additional motion, texture, and shape consistency constrains to promote more accurate hand poses and more consistent shapes and textures. Experiments on benchmark datasets demonstrate that our self-supervised approach produces comparable hand reconstruction performance compared with the recent full-supervised methods in single-frame as input setup, and notably improves the reconstruction accuracy and consistency when using video training data.

READ FULL TEXT

page 1

page 4

page 8

page 11

page 12

research
03/22/2021

Model-based 3D Hand Reconstruction via Self-Supervised Learning

Reconstructing a 3D hand from a single-view RGB image is challenging due...
research
08/25/2023

HiFiHR: Enhancing 3D Hand Reconstruction from a Single Image via High-Fidelity Texture

We present HiFiHR, a high-fidelity hand reconstruction approach that uti...
research
11/20/2019

Self-supervised Learning of 3D Objects from Natural Images

We present a method to learn single-view reconstruction of the 3D shape,...
research
03/04/2022

Time-to-Label: Temporal Consistency for Self-Supervised Monocular 3D Object Detection

Monocular 3D object detection continues to attract attention due to the ...
research
05/07/2020

Vid2Curve: Simultaneous Camera Motion Estimation and Thin Structure Reconstruction from an RGB Video

Thin structures, such as wire-frame sculptures, fences, cables, power li...
research
04/27/2022

3D Magic Mirror: Clothing Reconstruction from a Single Image via a Causal Perspective

This research aims to study a self-supervised 3D clothing reconstruction...
research
11/23/2022

Hand Avatar: Free-Pose Hand Animation and Rendering from Monocular Video

We present HandAvatar, a novel representation for hand animation and ren...

Please sign up or login with your details

Forgot password? Click here to reset