MPT: Mesh Pre-Training with Transformers for Human Pose and Mesh Reconstruction

11/24/2022
by   Kevin Lin, et al.
0

We present Mesh Pre-Training (MPT), a new pre-training framework that leverages 3D mesh data such as MoCap data for human pose and mesh reconstruction from a single image. Existing work in 3D pose and mesh reconstruction typically requires image-mesh pairs as the training data, but the acquisition of 2D-to-3D annotations is difficult. In this paper, we explore how to leverage 3D mesh data such as MoCap data, that does not have RGB images, for pre-training. The key idea is that even though 3D mesh data cannot be used for end-to-end training due to a lack of the corresponding RGB images, it can be used to pre-train the mesh regression transformer subnetwork. We observe that such pre-training not only improves the accuracy of mesh reconstruction from a single image, but also enables zero-shot capability. We conduct mesh pre-training using 2 million meshes. Experimental results show that MPT advances the state-of-the-art results on Human3.6M and 3DPW datasets. We also show that MPT enables transformer models to have zero-shot capability of human mesh reconstruction from real images. In addition, we demonstrate the generalizability of MPT to 3D hand reconstruction, achieving state-of-the-art results on FreiHAND dataset.

READ FULL TEXT

page 5

page 7

page 8

page 12

research
12/17/2020

End-to-End Human Pose and Mesh Reconstruction with Transformers

We present a new method, called MEsh TRansfOrmer (METRO), to reconstruct...
research
04/01/2021

Mesh Graphormer

We present a graph-convolution-reinforced transformer, named Mesh Grapho...
research
12/29/2018

Skeleton Transformer Networks: 3D Human Pose and Skinned Mesh from Single RGB Image

In this paper, we present Skeleton Transformer Networks (SkeletonNet), a...
research
07/27/2022

Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers

Transformer encoder architectures have recently achieved state-of-the-ar...
research
11/07/2021

Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online Adaptation

We consider a new problem of adapting a human mesh reconstruction model ...
research
02/15/2022

MeshLeTemp: Leveraging the Learnable Vertex-Vertex Relationship to Generalize Human Pose and Mesh Reconstruction for In-the-Wild Scenes

We present MeshLeTemp, a powerful method for 3D human pose and mesh reco...
research
11/18/2019

Towards Robust RGB-D Human Mesh Recovery

We consider the problem of human pose estimation. While much recent work...

Please sign up or login with your details

Forgot password? Click here to reset