Hand Image Understanding via Deep Multi-Task Learning

07/24/2021
by   Zhang Xiong, et al.
0

Analyzing and understanding hand information from multimedia materials like images or videos is important for many real world applications and remains active in research community. There are various works focusing on recovering hand information from single image, however, they usually solve a single task, for example, hand mask segmentation, 2D/3D hand pose estimation, or hand mesh reconstruction and perform not well in challenging scenarios. To further improve the performance of these tasks, we propose a novel Hand Image Understanding (HIU) framework to extract comprehensive information of the hand object from a single RGB image, by jointly considering the relationships between these tasks. To achieve this goal, a cascaded multi-task learning (MTL) backbone is designed to estimate the 2D heat maps, to learn the segmentation mask, and to generate the intermediate 3D information encoding, followed by a coarse-to-fine learning paradigm and a self-supervised learning strategy. Qualitative experiments demonstrate that our approach is capable of recovering reasonable mesh representations even in challenging situations. Quantitatively, our method significantly outperforms the state-of-the-art approaches on various widely-used datasets, in terms of diverse evaluation metrics.

READ FULL TEXT
research
07/29/2023

HandMIM: Pose-Aware Self-Supervised Learning for 3D Hand Mesh Estimation

With an enormous number of hand images generated over time, unleashing p...
research
04/25/2021

Parallel mesh reconstruction streams for pose estimation of interacting hands

We present a new multi-stream 3D mesh reconstruction network (MSMR-Net) ...
research
02/11/2021

Multi-Task Reinforcement Learning with Context-based Representations

The benefit of multi-task learning over single-task learning relies on t...
research
09/03/2021

Towards Accurate Alignment in Real-time 3D Hand-Mesh Reconstruction

3D hand-mesh reconstruction from RGB images facilitates many application...
research
05/31/2022

Mask2Hand: Learning to Predict the 3D Hand Pose and Shape from Shadow

We present a self-trainable method, Mask2Hand, which learns to solve the...
research
04/18/2022

End-to-end Weakly-supervised Multiple 3D Hand Mesh Reconstruction from Single Image

In this paper, we consider the challenging task of simultaneously locati...
research
11/03/2021

Unified 3D Mesh Recovery of Humans and Animals by Learning Animal Exercise

We propose an end-to-end unified 3D mesh recovery of humans and quadrupe...

Please sign up or login with your details

Forgot password? Click here to reset