Real-time RGBD-based Extended Body Pose Estimation

03/05/2021
by   Renat Bashirov, et al.
6

We present a system for real-time RGBD-based estimation of 3D human pose. We use parametric 3D deformable human mesh model (SMPL-X) as a representation and focus on the real-time estimation of parameters for the body pose, hands pose and facial expression from Kinect Azure RGB-D camera. We train estimators of body pose and facial expression parameters. Both estimators use previously published landmark extractors as input and custom annotated datasets for supervision, while hand pose is estimated directly by a previously published method. We combine the predictions of those estimators into a temporally-smooth human pose. We train the facial expression extractor on a large talking face dataset, which we annotate with facial expression parameters. For the body pose we collect and annotate a dataset of 56 people captured from a rig of 5 Kinect Azure RGB-D cameras and use it together with a large motion capture AMASS dataset. Our RGB-D body pose model outperforms the state-of-the-art RGB-only methods and works on the same level of accuracy compared to a slower RGB-D optimization-based solution. The combined system runs at 30 FPS on a server with a single GPU. The code will be available at https://saic-violet.github.io/rgbd-kinect-pose

READ FULL TEXT

page 3

page 6

page 8

research
04/11/2019

Expressive Body Capture: 3D Hands, Face, and Body from a Single Image

To facilitate the analysis of human actions, interactions and emotions, ...
research
10/11/2022

HiFECap: Monocular High-Fidelity and Expressive Capture of Human Performances

Monocular 3D human performance capture is indispensable for many applica...
research
05/12/2020

View-invariant Pose Analysis for Human Movement Assessment from RGB Data

We propose a CNN regression method to generate high-level, view-invaria...
research
11/22/2017

Cascaded 3D Full-body Pose Regression from Single Depth Image at 100 FPS

There are increasingly real-time live applications in virtual reality, w...
research
01/10/2017

Unite the People: Closing the Loop Between 3D and 2D Human Representations

3D models provide a common ground for different representations of human...
research
05/26/2019

EgoFace: Egocentric Face Performance Capture and Videorealistic Reenactment

Face performance capture and reenactment techniques use multiple cameras...
research
04/01/2020

BCNet: Learning Body and Cloth Shape from A Single Image

In this paper, we consider the problem to automatically reconstruct garm...

Please sign up or login with your details

Forgot password? Click here to reset