Spectral Graphormer: Spectral Graph-based Transformer for Egocentric Two-Hand Reconstruction using Multi-View Color Images

08/21/2023
by   Tze Ho Elden Tse, et al.
0

We propose a novel transformer-based framework that reconstructs two high fidelity hands from multi-view RGB images. Unlike existing hand pose estimation methods, where one typically trains a deep network to regress hand model parameters from single RGB image, we consider a more challenging problem setting where we directly regress the absolute root poses of two-hands with extended forearm at high resolution from egocentric view. As existing datasets are either infeasible for egocentric viewpoints or lack background variations, we create a large-scale synthetic dataset with diverse scenarios and collect a real dataset from multi-calibrated camera setup to verify our proposed multi-view image feature fusion strategy. To make the reconstruction physically plausible, we propose two strategies: (i) a coarse-to-fine spectral graph convolution decoder to smoothen the meshes during upsampling and (ii) an optimisation-based refinement stage at inference to prevent self-penetrations. Through extensive quantitative and qualitative evaluations, we show that our framework is able to produce realistic two-hand reconstructions and demonstrate the generalisation of synthetic-trained models to real data, as well as real-time AR/VR applications.

READ FULL TEXT

page 1

page 3

page 6

page 8

page 9

page 14

page 17

page 18

research
07/02/2019

HO-3D: A Multi-User, Multi-Object Dataset for Joint 3D Hand-Object Pose Estimation

We propose a new dataset for 3D hand+object pose estimation from color i...
research
10/31/2022

UmeTrack: Unified multi-view end-to-end hand tracking for VR

Real-time tracking of 3D hand pose in world space is a challenging probl...
research
10/17/2017

Real-time marker-less multi-person 3D pose estimation in RGB-Depth camera networks

This paper proposes a novel system to estimate and track the 3D poses of...
research
09/13/2021

Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images

This paper studies the task of estimating the 3D human poses of multiple...
research
07/05/2022

Array Camera Image Fusion using Physics-Aware Transformers

We demonstrate a physics-aware transformer for feature-based data fusion...
research
03/17/2023

ShaRPy: Shape Reconstruction and Hand Pose Estimation from RGB-D with Uncertainty

Despite their potential, markerless hand tracking technologies are not y...
research
04/04/2023

Learning to Recover Spectral Reflectance from RGB Images

This paper tackles spectral reflectance recovery (SRR) from RGB images. ...

Please sign up or login with your details

Forgot password? Click here to reset