See You Soon: Decoupled Iterative Refinement Framework for Interacting Hands Reconstruction from a Single RGB Image

02/05/2023
by   Pengfei Ren, et al.
1

Reconstructing interacting hands from a single RGB image is a very challenging task. On the one hand, severe mutual occlusion and similar local appearance between two hands confuse the extraction of visual features, resulting in the misalignment of estimated hand meshes and the image. On the other hand, there are complex interaction patterns between interacting hands, which significantly increases the solution space of hand poses and increases the difficulty of network learning. In this paper, we propose a decoupled iterative refinement framework to achieve pixel-alignment hand reconstruction while efficiently modeling the spatial relationship between hands. Specifically, we define two feature spaces with different characteristics, namely 2D visual feature space and 3D joint feature space. First, we obtain joint-wise features from the visual feature map and utilize a graph convolution network and a transformer to perform intra- and inter-hand information interaction in the 3D joint feature space, respectively. Then, we project the joint features with global information back into the 2D visual feature space in an obfuscation-free manner and utilize the 2D convolution for pixel-wise enhancement. By performing multiple alternate enhancements in the two feature spaces, our method can achieve an accurate and robust reconstruction of interacting hands. Our method outperforms all existing two-hand reconstruction methods by a large margin on the InterHand2.6M dataset. Meanwhile, our method shows a strong generalization ability for in-the-wild images.

READ FULL TEXT

page 1

page 4

page 6

page 7

page 8

page 10

page 11

research
03/17/2022

Interacting Attention Graph for Single Image Two-Hand Reconstruction

Graph convolutional network (GCN) has achieved great success in single h...
research
04/07/2023

A2J-Transformer: Anchor-to-Joint Transformer Network for 3D Interacting Hand Pose Estimation from a Single RGB Image

3D interacting hand pose estimation from a single RGB image is a challen...
research
03/10/2023

ACR: Attention Collaboration-based Regressor for Arbitrary Two-Hand Reconstruction

Reconstructing two hands from monocular RGB images is challenging due to...
research
08/08/2023

Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video

Reconstructing interacting hands from monocular RGB data is a challengin...
research
08/27/2023

Reconstructing Interacting Hands with Interaction Prior from Monocular Images

Reconstructing interacting hands from monocular images is indispensable ...
research
08/21/2022

LWA-HAND: Lightweight Attention Hand for Interacting Hand Reconstruction

Hand reconstruction has achieved great success in real-time applications...
research
07/01/2021

Learning to Disambiguate Strongly Interacting Hands via Probabilistic Per-pixel Part Segmentation

In natural conversation and interaction, our hands often overlap or are ...

Please sign up or login with your details

Forgot password? Click here to reset