LWA-HAND: Lightweight Attention Hand for Interacting Hand Reconstruction

08/21/2022
by   Xinhan Di, et al.
0

Hand reconstruction has achieved great success in real-time applications such as visual reality and augmented reality while interacting with two-hand reconstruction through efficient transformers is left unexplored. In this paper, we propose a method called lightweight attention hand (LWA-HAND) to reconstruct hands in low flops from a single RGB image. To solve the occlusion and interaction challenges in efficient attention architectures, we introduce three mobile attention modules. The first module is a lightweight feature attention module that extracts both local occlusion representation and global image patch representation in a coarse-to-fine manner. The second module is a cross image and graph bridge module which fuses image context and hand vertex. The third module is a lightweight cross-attention mechanism that uses element-wise operation for cross attention of two hands in linear complexity. The resulting model achieves comparable performance on the InterHand2.6M benchmark in comparison with the state-of-the-art models. Simultaneously, it reduces the flops to 0.47GFlops while the state-of-the-art models have heavy computations between 10GFlops and 20GFlops.

READ FULL TEXT

page 2

page 3

page 8

page 13

research
03/17/2022

Interacting Attention Graph for Single Image Two-Hand Reconstruction

Graph convolutional network (GCN) has achieved great success in single h...
research
02/28/2023

Im2Hands: Learning Attentive Implicit Representation of Interacting Two-Hand Shapes

We present Implicit Two Hands (Im2Hands), the first neural implicit repr...
research
03/10/2023

ACR: Attention Collaboration-based Regressor for Arbitrary Two-Hand Reconstruction

Reconstructing two hands from monocular RGB images is challenging due to...
research
02/05/2023

See You Soon: Decoupled Iterative Refinement Framework for Interacting Hands Reconstruction from a Single RGB Image

Reconstructing interacting hands from a single RGB image is a very chall...
research
08/08/2023

Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video

Reconstructing interacting hands from monocular RGB data is a challengin...
research
10/25/2021

Highly Efficient Natural Image Matting

Over the last few years, deep learning based approaches have achieved ou...
research
08/05/2019

3D Reconstruction of Deformable Revolving Object under Heavy Hand Interaction

We reconstruct 3D deformable object through time, in the context of a li...

Please sign up or login with your details

Forgot password? Click here to reset