TriPose: A Weakly-Supervised 3D Human Pose Estimation via Triangulation from Video

05/14/2021
by   Mohsen Gholami, et al.
0

Estimating 3D human poses from video is a challenging problem. The lack of 3D human pose annotations is a major obstacle for supervised training and for generalization to unseen datasets. In this work, we address this problem by proposing a weakly-supervised training scheme that does not require 3D annotations or calibrated cameras. The proposed method relies on temporal information and triangulation. Using 2D poses from multiple views as the input, we first estimate the relative camera orientations and then generate 3D poses via triangulation. The triangulation is only applied to the views with high 2D human joint confidence. The generated 3D poses are then used to train a recurrent lifting network (RLN) that estimates 3D poses from 2D poses. We further apply a multi-view re-projection loss to the estimated 3D poses and enforce the 3D poses estimated from multi-views to be consistent. Therefore, our method relaxes the constraints in practice, only multi-view videos are required for training, and is thus convenient for in-the-wild settings. At inference, RLN merely requires single-view videos. The proposed method outperforms previous works on two challenging datasets, Human3.6M and MPI-INF-3DHP. Codes and pretrained models will be publicly available.

READ FULL TEXT

page 1

page 7

page 8

research
03/17/2020

Weakly-Supervised 3D Human Pose Learning via Multi-view Images in the Wild

One major challenge for monocular 3D human pose estimation in-the-wild i...
research
06/10/2021

SVMA: A GAN-based model for Monocular 3D Human Pose Estimation

Recovering 3D human pose from 2D joints is a highly unconstrained proble...
research
01/08/2023

CameraPose: Weakly-Supervised Monocular 3D Human Pose Estimation by Leveraging In-the-wild 2D Annotations

To improve the generalization of 3D human pose estimators, many existing...
research
10/21/2022

3D Human Pose Estimation in Multi-View Operating Room Videos Using Differentiable Camera Projections

3D human pose estimation in multi-view operating room (OR) videos is a r...
research
06/13/2020

Dynamic gesture retrieval: searching videos by human pose sequence

The number of static human poses is limited, it is hard to retrieve the ...
research
11/28/2018

3D human pose estimation in video with temporal convolutions and semi-supervised training

In this work, we demonstrate that 3D poses in video can be effectively e...
research
03/29/2022

On Triangulation as a Form of Self-Supervision for 3D Human Pose Estimation

Supervised approaches to 3D pose estimation from single images are remar...

Please sign up or login with your details

Forgot password? Click here to reset