VIBE: Video Inference for Human Body Pose and Shape Estimation

12/11/2019
by   Muhammed Kocabas, et al.
13

Human motion is fundamental to understanding behavior. Despite progress on single-image 3D pose and shape estimation, existing video-based state-of-the-art methods fail to produce accurate and natural motion sequences due to a lack of ground-truth 3D motion data for training. To address this problem, we propose Video Inference for Body Pose and Shape Estimation (VIBE), which makes use of an existing large-scale motion capture dataset (AMASS) together with unpaired, in-the-wild, 2D keypoint annotations. Our key novelty is an adversarial learning framework that leverages AMASS to discriminate between real human motions and those produced by our temporal pose and shape regression networks. We define a temporal network architecture and show that adversarial training, at the sequence level, produces kinematically plausible motion sequences without in-the-wild ground-truth 3D labels. We perform extensive experimentation to analyze the importance of motion and demonstrate the effectiveness of VIBE on challenging 3D pose estimation datasets, achieving state-of-the-art performance. Code and pretrained models are available at https://github.com/mkocabas/VIBE.

READ FULL TEXT

page 1

page 3

page 8

page 12

research
08/20/2023

Co-Evolution of Pose and Mesh for 3D Human Body Estimation from Video

Despite significant progress in single image-based 3D human mesh recover...
research
05/10/2021

HuMoR: 3D Human Motion Model for Robust Pose Estimation

We introduce HuMoR: a 3D Human Motion Model for Robust Estimation of tem...
research
07/25/2022

3D Shape Sequence of Human Comparison and Classification using Current and Varifolds

In this paper we address the task of the comparison and the classificati...
research
03/16/2022

Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video

Learning to capture human motion is essential to 3D human pose and shape...
research
05/30/2023

Decomposed Human Motion Prior for Video Pose Estimation via Adversarial Training

Estimating human pose from video is a task that receives considerable at...
research
06/21/2022

Domain Adaptive 3D Pose Augmentation for In-the-wild Human Mesh Recovery

The ability to perceive 3D human bodies from a single image has a multit...
research
08/22/2022

PoseBERT: A Generic Transformer Module for Temporal 3D Human Modeling

Training state-of-the-art models for human pose estimation in videos req...

Please sign up or login with your details

Forgot password? Click here to reset