Hand Pose Estimation via Multiview Collaborative Self-Supervised Learning

02/02/2023
by   Xiaozheng Zheng, et al.
9

3D hand pose estimation has made significant progress in recent years. However, the improvement is highly dependent on the emergence of large-scale annotated datasets. To alleviate the label-hungry limitation, we propose a multi-view collaborative self-supervised learning framework, HaMuCo, that estimates hand pose only with pseudo labels for training. We use a two-stage strategy to tackle the noisy label challenge and the multi-view “groupthink” problem. In the first stage, we estimate the 3D hand poses for each view independently. In the second stage, we employ a cross-view interaction network to capture the cross-view correlated features and use multi-view consistency loss to achieve collaborative learning among views. To further enhance the collaboration between single-view and multi-view, we fuse the results of all views to supervise the single-view network. To summarize, we introduce collaborative learning in two folds, the cross-view level and the multi- to single-view level. Extensive experiments show that our method can achieve state-of-the-art performance on multi-view self-supervised hand pose estimation. Moreover, ablation studies verify the effectiveness of each component. Results on multiple datasets further demonstrate the generalization ability of our network.

READ FULL TEXT

page 1

page 8

page 12

page 13

page 14

page 15

page 16

page 17

research
10/13/2020

Self-Supervised Multi-View Synchronization Learning for 3D Pose Estimation

Current state-of-the-art methods cast monocular 3D human pose estimation...
research
12/27/2021

Active Learning with Pseudo-Labels for Multi-View 3D Pose Estimation

Pose estimation of the human body/hand is a fundamental problem in compu...
research
04/22/2023

Self-supervised Learning by View Synthesis

We present view-synthesis autoencoders (VSA) in this paper, which is a s...
research
06/23/2016

Robust 3D Hand Pose Estimation in Single Depth Images: from Single-View CNN to Multi-View CNNs

Articulated hand pose estimation plays an important role in human-comput...
research
09/29/2016

Multi-view Self-supervised Deep Learning for 6D Pose Estimation in the Amazon Picking Challenge

Robot warehouse automation has attracted significant interest in recent ...
research
01/06/2022

Enhancing Egocentric 3D Pose Estimation with Third Person Views

In this paper, we propose a novel approach to enhance the 3D body pose e...
research
09/24/2021

Multi-View Video-Based 3D Hand Pose Estimation

Hand pose estimation (HPE) can be used for a variety of human-computer i...

Please sign up or login with your details

Forgot password? Click here to reset