Weakly-Supervised 3D Human Pose Learning via Multi-view Images in the Wild

03/17/2020
by   Umar Iqbal, et al.
15

One major challenge for monocular 3D human pose estimation in-the-wild is the acquisition of training data that contains unconstrained images annotated with accurate 3D poses. In this paper, we address this challenge by proposing a weakly-supervised approach that does not require 3D annotations and learns to estimate 3D poses from unlabeled multi-view data, which can be acquired easily in in-the-wild environments. We propose a novel end-to-end learning framework that enables weakly-supervised training using multi-view consistency. Since multi-view consistency is prone to degenerated solutions, we adopt a 2.5D pose representation and propose a novel objective function that can only be minimized when the predictions of the trained model are consistent and plausible across all camera views. We evaluate our proposed approach on two large scale datasets (Human3.6M and MPII-INF-3DHP) where it achieves state-of-the-art performance among semi-/weakly-supervised methods.

READ FULL TEXT

page 3

page 7

research
05/14/2021

TriPose: A Weakly-Supervised 3D Human Pose Estimation via Triangulation from Video

Estimating 3D human poses from video is a challenging problem. The lack ...
research
09/13/2019

Weakly-Supervised 3D Pose Estimation from a Single Image using Multi-View Consistency

We present a novel data-driven regularizer for weakly-supervised learnin...
research
10/19/2022

Multi-view Tracking Using Weakly Supervised Human Motion Prediction

Multi-view approaches to people-tracking have the potential to better ha...
research
05/23/2021

Weakly-supervised Cross-view 3D Human Pose Estimation

Although monocular 3D human pose estimation methods have made significan...
research
08/18/2019

Distill Knowledge from NRSfM for Weakly Supervised 3D Pose Learning

We propose to learn a 3D pose estimator by distilling knowledge from Non...
research
10/21/2021

Weakly Supervised Training of Monocular 3D Object Detectors Using Wide Baseline Multi-view Traffic Camera Data

Accurate 7DoF prediction of vehicles at an intersection is an important ...
research
03/13/2018

Learning Monocular 3D Human Pose Estimation from Multi-view Images

Accurate 3D human pose estimation from single images is possible with so...

Please sign up or login with your details

Forgot password? Click here to reset