Coherent Reconstruction of Multiple Humans from a Single Image

06/15/2020
by   Wen Jiang, et al.
4

In this work, we address the problem of multi-person 3D pose estimation from a single image. A typical regression approach in the top-down setting of this problem would first detect all humans and then reconstruct each one of them independently. However, this type of prediction suffers from incoherent results, e.g., interpenetration and inconsistent depth ordering between the people in the scene. Our goal is to train a single network that learns to avoid these problems and generate a coherent 3D reconstruction of all the humans in the scene. To this end, a key design choice is the incorporation of the SMPL parametric body model in our top-down framework, which enables the use of two novel losses. First, a distance field-based collision loss penalizes interpenetration among the reconstructed people. Second, a depth ordering-aware loss reasons about occlusions and promotes a depth ordering of people that leads to a rendering which is consistent with the annotated instance segmentation. This provides depth supervision signals to the network, even if the image has no explicit 3D annotations. The experiments show that our approach outperforms previous methods on standard 3D pose benchmarks, while our proposed losses enable more coherent reconstruction in natural images. The project website with videos, results, and code can be found at: https://jiangwenpl.github.io/multiperson

READ FULL TEXT

page 2

page 5

page 8

research
11/02/2021

Body Size and Depth Disambiguation in Multi-Person Reconstruction from Single Images

We address the problem of multi-person 3D body pose and shape estimation...
research
04/19/2021

Multi-person Implicit Reconstruction from a Single Image

We present a new end-to-end learning framework to obtain detailed and sp...
research
09/29/2020

Multi-View Consistency Loss for Improved Single-Image 3D Reconstruction of Clothed People

We present a novel method to improve the accuracy of the 3D reconstructi...
research
07/23/2019

U4D: Unsupervised 4D Dynamic Scene Understanding

We introduce the first approach to solve the challenging problem of unsu...
research
12/15/2021

Putting People in their Place: Monocular Regression of 3D People in Depth

Given an image with multiple people, our goal is to directly regress the...
research
05/12/2015

Monocular Object Instance Segmentation and Depth Ordering with CNNs

In this paper we tackle the problem of instance-level segmentation and d...
research
07/15/2021

Single-image Full-body Human Relighting

We present a single-image data-driven method to automatically relight im...

Please sign up or login with your details

Forgot password? Click here to reset