Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video

08/08/2023
by   Weichao Zhao, et al.
0

Reconstructing interacting hands from monocular RGB data is a challenging task, as it involves many interfering factors, e.g. self- and mutual occlusion and similar textures. Previous works only leverage information from a single RGB image without modeling their physically plausible relation, which leads to inferior reconstruction results. In this work, we are dedicated to explicitly exploiting spatial-temporal information to achieve better interacting hand reconstruction. On one hand, we leverage temporal context to complement insufficient information provided by the single frame, and design a novel temporal framework with a temporal constraint for interacting hand motion smoothness. On the other hand, we further propose an interpenetration detection module to produce kinetically plausible interacting hands without physical collisions. Extensive experiments are performed to validate the effectiveness of our proposed framework, which achieves new state-of-the-art performance on public benchmarks.

READ FULL TEXT

page 1

page 4

page 9

page 10

page 12

page 13

research
11/01/2021

Monocular 3D Reconstruction of Interacting Hands via Collision-Aware Factorized Refinements

3D interacting hand reconstruction is essential to facilitate human-mach...
research
03/10/2023

ACR: Attention Collaboration-based Regressor for Arbitrary Two-Hand Reconstruction

Reconstructing two hands from monocular RGB images is challenging due to...
research
02/05/2023

See You Soon: Decoupled Iterative Refinement Framework for Interacting Hands Reconstruction from a Single RGB Image

Reconstructing interacting hands from a single RGB image is a very chall...
research
04/07/2023

A2J-Transformer: Anchor-to-Joint Transformer Network for 3D Interacting Hand Pose Estimation from a Single RGB Image

3D interacting hand pose estimation from a single RGB image is a challen...
research
04/24/2023

gSDF: Geometry-Driven Signed Distance Functions for 3D Hand-Object Reconstruction

Signed distance functions (SDFs) is an attractive framework that has rec...
research
08/21/2022

LWA-HAND: Lightweight Attention Hand for Interacting Hand Reconstruction

Hand reconstruction has achieved great success in real-time applications...
research
07/27/2023

Physically Plausible 3D Human-Scene Reconstruction from Monocular RGB Image using an Adversarial Learning Approach

Holistic 3D human-scene reconstruction is a crucial and emerging researc...

Please sign up or login with your details

Forgot password? Click here to reset