Exemplar Fine-Tuning for 3D Human Pose Fitting Towards In-the-Wild 3D Human Pose Estimation

04/07/2020
by   Hanbyul Joo, et al.
5

We propose a method for building large collections of human poses with full 3D annotations captured `in the wild', for which specialized capture equipment cannot be used. We start with a dataset with 2D keypoint annotations such as COCO and MPII and generates corresponding 3D poses. This is done via Exemplar Fine-Tuning (EFT), a new method to fit a 3D parametric model to 2D keypoints. EFT is accurate and can exploit a data-driven pose prior to resolve the depth reconstruction ambiguity that comes from using only 2D observations as input. We use EFT to augment these large in-the-wild datasets with plausible and accurate 3D pose annotations. We then use this data to strongly supervise a 3D pose regression network, achieving state-of-the-art results in standard benchmarks, including the ones collected outdoor. This network also achieves unprecedented 3D pose estimation quality on extremely challenging Internet videos.

READ FULL TEXT

page 1

page 6

page 7

page 8

research
11/22/2022

Weakly-supervised Pre-training for 3D Human Pose Estimation via Perspective Knowledge

Modern deep learning-based 3D pose estimation approaches require plenty ...
research
04/05/2019

In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations

Convolutional Neural Network based approaches for monocular 3D human pos...
research
07/07/2016

MoCap-guided Data Augmentation for 3D Pose Estimation in the Wild

This paper addresses the problem of 3D human pose estimation in the wild...
research
11/03/2017

In-Bed Pose Estimation: Deep Learning with Shallow Dataset

Although human pose estimation for various computer vision (CV) applicat...
research
05/10/2019

Exploiting temporal context for 3D human pose estimation in the wild

We present a bundle-adjustment-based algorithm for recovering accurate 3...
research
12/29/2022

Learning 3D Human Pose Estimation from Dozens of Datasets using a Geometry-Aware Autoencoder to Bridge Between Skeleton Formats

Deep learning-based 3D human pose estimation performs best when trained ...
research
01/10/2017

Unite the People: Closing the Loop Between 3D and 2D Human Representations

3D models provide a common ground for different representations of human...

Please sign up or login with your details

Forgot password? Click here to reset