Body2Hands: Learning to Infer 3D Hands from Conversational Gesture Body Dynamics

07/23/2020
by   Evonne Ng, et al.
10

We propose a novel learned deep prior of body motion for 3D hand shape synthesis and estimation in the domain of conversational gestures. Our model builds upon the insight that body motion and hand gestures are strongly correlated in non-verbal communication settings. We formulate the learning of this prior as a prediction task of 3D hand shape over time given body motion input alone. Trained with 3D pose estimations obtained from a large-scale dataset of internet videos, our hand prediction model produces convincing 3D hand gestures given only the 3D motion of the speaker's arms as input. We demonstrate the efficacy of our method on hand gesture synthesis from body motion input, and as a strong body prior for single-view image-based 3D hand pose estimation. We demonstrate that our method outperforms previous state-of-the-art approaches and can generalize beyond the monologue-based training data to multi-person conversations. Video results are available at http://people.eecs.berkeley.edu/ evonne_ng/projects/body2hands/.

READ FULL TEXT

page 6

page 7

research
02/13/2021

Learning Speech-driven 3D Conversational Gestures from Video

We propose the first approach to automatically and jointly synthesize bo...
research
06/10/2019

Learning Individual Styles of Conversational Gesture

Human speech is often accompanied by hand and arm gestures. Given audio ...
research
03/30/2022

Spatial-Temporal Parallel Transformer for Arm-Hand Dynamic Estimation

We propose an approach to estimate arm and hand dynamics from monocular ...
research
10/05/2019

To React or not to React: End-to-End Visual Pose Forecasting for Personalized Avatar during Dyadic Conversations

Non verbal behaviours such as gestures, facial expressions, body posture...
research
11/17/2020

Whose hand is this? Person Identification from Egocentric Hand Gestures

Recognizing people by faces and other biometrics has been extensively st...
research
04/20/2023

Reconstructing Signing Avatars From Video Using Linguistic Priors

Sign language (SL) is the primary method of communication for the 70 mil...
research
06/11/2020

A Deep Learning Framework for Recognizing both Static and Dynamic Gestures

Intuitive user interfaces are indispensable to interact with human centr...

Please sign up or login with your details

Forgot password? Click here to reset