MUG: Multi-human Graph Network for 3D Mesh Reconstruction from 2D Pose

05/25/2022
by   Chenyan Wu, et al.
0

Reconstructing multi-human body mesh from a single monocular image is an important but challenging computer vision problem. In addition to the individual body mesh models, we need to estimate relative 3D positions among subjects to generate a coherent representation. In this work, through a single graph neural network, named MUG (Multi-hUman Graph network), we construct coherent multi-human meshes using only multi-human 2D pose as input. Compared with existing methods, which adopt a detection-style pipeline (i.e., extracting image features and then locating human instances and recovering body meshes from that) and suffer from the significant domain gap between lab-collected training datasets and in-the-wild testing datasets, our method benefits from the 2D pose which has a relatively consistent geometric property across datasets. Our method works like the following: First, to model the multi-human environment, it processes multi-human 2D poses and builds a novel heterogeneous graph, where nodes from different people and within one person are connected to capture inter-human interactions and draw the body geometry (i.e., skeleton and mesh structure). Second, it employs a dual-branch graph neural network structure – one for predicting inter-human depth relation and the other one for predicting root-joint-relative mesh coordinates. Finally, the entire multi-human 3D meshes are constructed by combining the output from both branches. Extensive experiments demonstrate that MUG outperforms previous multi-human mesh estimation methods on standard 3D human benchmarks – Panoptic, MuPoTS-3D and 3DPW.

READ FULL TEXT
research
08/20/2020

Pose2Mesh: Graph Convolutional Network for 3D Human Pose and Mesh Recovery from a 2D Human Pose

Most of the recent deep learning-based 3D human pose and mesh estimation...
research
06/01/2019

Temporally Coherent Full 3D Mesh Human Pose Recovery from Monocular Video

Advances in Deep Learning have recently made it possible to recover full...
research
10/24/2022

Multi-Person 3D Pose and Shape Estimation via Inverse Kinematics and Refinement

Estimating 3D poses and shapes in the form of meshes from monocular RGB ...
research
10/27/2020

Synthetic Training for Monocular Human Mesh Recovery

Recovering 3D human mesh from monocular images is a popular topic in com...
research
06/23/2023

A Graph Neural Network Approach for Temporal Mesh Blending and Correspondence

We have proposed a self-supervised deep learning framework for solving t...
research
05/28/2019

Cerberus: A Multi-headed Derenderer

To generalize to novel visual scenes with new viewpoints and new object ...
research
08/23/2023

Pose Modulated Avatars from Video

It is now possible to reconstruct dynamic human motion and shape from a ...

Please sign up or login with your details

Forgot password? Click here to reset