VNect: Real-time 3D Human Pose Estimation with a Single RGB Camera

05/03/2017
by   Dushyant Mehta, et al.
0

We present the first real-time method to capture the full global 3D skeletal pose of a human in a stable, temporally consistent manner using a single RGB camera. Our method combines a new convolutional neural network (CNN) based pose regressor with kinematic skeleton fitting. Our novel fully-convolutional pose formulation regresses 2D and 3D joint positions jointly in real time and does not require tightly cropped input frames. A real-time kinematic skeleton fitting method uses the CNN output to yield temporally stable 3D global pose reconstructions on the basis of a coherent kinematic skeleton. This makes our approach the first monocular RGB method usable in real-time applications such as 3D character control---thus far, the only monocular methods for such applications employed specialized RGB-D cameras. Our method's accuracy is quantitatively on par with the best offline 3D monocular RGB pose estimation methods. Our results are qualitatively comparable to, and sometimes better than, results from monocular RGB-D approaches, such as the Kinect. However, we show that our approach is more broadly applicable than RGB-D solutions, i.e. it works for outdoor scenes, community videos, and low quality commodity RGB cameras.

READ FULL TEXT

page 1

page 5

page 7

page 8

page 10

page 11

research
07/01/2019

XNect: Real-time Multi-person 3D Human Pose Estimation with a Single RGB Camera

We present a real-time approach for multi-person 3D motion capture at ov...
research
12/11/2017

Using a single RGB frame for real time 3D hand pose estimation in the wild

We present a method for the real-time estimation of the full 3D pose of ...
research
06/22/2020

MotioNet: 3D Human Motion Reconstruction from Monocular Video with Skeleton Consistency

We introduce MotioNet, a deep neural network that directly reconstructs ...
research
12/04/2017

GANerated Hands for Real-time 3D Hand Tracking from Monocular RGB

We address the highly challenging problem of real-time 3D hand tracking ...
research
04/14/2023

CAMM: Building Category-Agnostic and Animatable 3D Models from Monocular Videos

Animating an object in 3D often requires an articulated structure, e.g. ...
research
05/27/2015

PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization

We present a robust and real-time monocular six degree of freedom reloca...
research
08/21/2018

Real Time Elbow Angle Estimation Using Single RGB Camera

The use of motion capture has increased from last decade in a varied spe...

Please sign up or login with your details

Forgot password? Click here to reset