LiftFormer: 3D Human Pose Estimation using attention models

09/01/2020
by   Adrian Llopart, et al.
0

Estimating the 3D position of human joints has become a widely researched topic in the last years. Special emphasis has gone into defining novel methods that extrapolate 2-dimensional data (keypoints) into 3D, namely predicting the root-relative coordinates of joints associated to human skeletons. The latest research trends have proven that the Transformer Encoder blocks aggregate temporal information significantly better than previous approaches. Thus, we propose the usage of these models to obtain more accurate 3D predictions by leveraging temporal information using attention mechanisms on ordered sequences human poses in videos. Our method consistently outperforms the previous best results from the literature when using both 2D keypoint predictors by 0.3 mm (44.8 MPJPE, 0.7 improvement) and ground truth inputs by 2mm (MPJPE: 31.9, 8.4 Human3.6M. It also achieves state-of-the-art performance on the HumanEva-I dataset with 10.5 P-MPJPE (22.2 model is easily tunable and is smaller (9.5M) than current methodologies (16.95M and 11.25M) whilst still having better performance. Thus, our 3D lifting model's accuracy exceeds that of other end-to-end or SMPL approaches and is comparable to many multi-view methods.

READ FULL TEXT
research
11/21/2019

Consensus-based Optimization for 3D Human Pose Estimation in Camera Coordinates

3D human pose estimation is frequently seen as the task of estimating 3D...
research
05/23/2019

Pose estimator and tracker using temporal flow maps for limbs

For human pose estimation in videos, it is significant how to use tempor...
research
10/24/2022

Video based Object 6D Pose Estimation using Transformers

We introduce a Transformer based 6D Object Pose Estimation framework Vid...
research
05/09/2021

Estimation of 3D Human Pose Using Prior Knowledge

Estimating three-dimensional human poses from the positions of two-dimen...
research
07/29/2021

Improving Robustness and Accuracy via Relative Information Encoding in 3D Human Pose Estimation

Most of the existing 3D human pose estimation approaches mainly focus on...
research
12/06/2022

DiffuPose: Monocular 3D Human Pose Estimation via Denoising Diffusion Probabilistic Model

Thanks to the development of 2D keypoint detectors, monocular 3D human p...
research
05/10/2018

Dealing with sequences in the RGBDT space

Most of the current research in computer vision is focused on working wi...

Please sign up or login with your details

Forgot password? Click here to reset