Exploiting temporal context for 3D human pose estimation in the wild

05/10/2019
by   Anurag Arnab, et al.
8

We present a bundle-adjustment-based algorithm for recovering accurate 3D human pose and meshes from monocular videos. Unlike previous algorithms which operate on single frames, we show that reconstructing a person over an entire sequence gives extra constraints that can resolve ambiguities. This is because videos often give multiple views of a person, yet the overall body shape does not change and 3D positions vary slowly. Our method improves not only on standard mocap-based datasets like Human 3.6M -- where we show quantitative improvements -- but also on challenging in-the-wild datasets such as Kinetics. Building upon our algorithm, we present a new dataset of more than 3 million frames of YouTube videos from Kinetics with automatically generated 3D poses and meshes. We show that retraining a single-frame 3D pose estimator on this data improves accuracy on both real-world and mocap data by evaluating on the 3DPW and HumanEVA datasets.

READ FULL TEXT

page 2

page 4

page 5

page 8

page 13

research
10/16/2020

Towards Accurate Human Pose Estimation in Videos of Crowded Scenes

Video-based human pose estimation in crowded scenes is a challenging pro...
research
04/15/2021

3DCrowdNet: 2D Human Pose-Guided3D Crowd Human Pose and Shape Estimation in the Wild

Recovering accurate 3D human pose and shape from in-the-wild crowd scene...
research
04/07/2020

Exemplar Fine-Tuning for 3D Human Pose Fitting Towards In-the-Wild 3D Human Pose Estimation

We propose a method for building large collections of human poses with f...
research
06/01/2019

Temporally Coherent Full 3D Mesh Human Pose Recovery from Monocular Video

Advances in Deep Learning have recently made it possible to recover full...
research
09/15/2023

Towards Robust and Smooth 3D Multi-Person Pose Estimation from Monocular Videos in the Wild

3D pose estimation is an invaluable task in computer vision with various...
research
11/23/2020

NeuralAnnot: Neural Annotator for in-the-wild Expressive 3D Human Pose and Mesh Training Sets

Recovering expressive 3D human pose and mesh from in-the-wild images is ...
research
12/16/2018

Human Pose and Path Estimation from Aerial Video using Dynamic Classifier Selection

We consider the problem of estimating human pose and trajectory by an ae...

Please sign up or login with your details

Forgot password? Click here to reset