Populating 3D Scenes by Learning Human-Scene Interaction

12/21/2020
by   Mohamed Hassan, et al.
8

Humans live within a 3D space and constantly interact with it to perform tasks. Such interactions involve physical contact between surfaces that is semantically meaningful. Our goal is to learn how humans interact with scenes and leverage this to enable virtual characters to do the same. To that end, we introduce a novel Human-Scene Interaction (HSI) model that encodes proximal relationships, called POSA for "Pose with prOximitieS and contActs". The representation of interaction is body-centric, which enables it to generalize to new scenes. Specifically, POSA augments the SMPL-X parametric human body model such that, for every mesh vertex, it encodes (a) the contact probability with the scene surface and (b) the corresponding semantic scene label. We learn POSA with a VAE conditioned on the SMPL-X vertices, and train on the PROX dataset, which contains SMPL-X meshes of people interacting with 3D scenes, and the corresponding scene semantics from the PROX-E dataset. We demonstrate the value of POSA with two applications. First, we automatically place 3D scans of people in scenes. We use a SMPL-X model fit to the scan as a proxy and then find its most likely placement in 3D. POSA provides an effective representation to search for "affordances" in the scene that match the likely contact relationships for that pose. We perform a perceptual study that shows significant improvement over the state of the art on this task. Second, we show that POSA's learned representation of body-scene interaction supports monocular human pose estimation that is consistent with a 3D scene, improving on the state of the art. Our model and code will be available for research purposes at https://posa.is.tue.mpg.de.

READ FULL TEXT

page 1

page 5

page 6

page 13

page 14

research
08/20/2019

Resolving 3D Human Pose Ambiguities with 3D Scene Constraints

To understand and analyze human behavior, we need to capture humans movi...
research
08/12/2020

Generating Person-Scene Interactions in 3D Scenes

High fidelity digital 3D environments have been proposed in recent years...
research
12/05/2019

Generating 3D People in Scenes without People

We present a fully-automatic system that takes a 3D scene and generates ...
research
04/09/2018

Binge Watching: Scaling Affordance Learning from Sitcoms

In recent years, there has been a renewed interest in jointly modeling p...
research
03/27/2023

Hi4D: 4D Instance Segmentation of Close Human Interaction

We propose Hi4D, a method and dataset for the automatic analysis of phys...
research
03/06/2023

Detecting Human-Object Contact in Images

Humans constantly contact objects to move and perform tasks. Thus, detec...
research
03/31/2021

Human POSEitioning System (HPS): 3D Human Pose Estimation and Self-localization in Large Scenes from Body-Mounted Sensors

We introduce (HPS) Human POSEitioning System, a method to recover the fu...

Please sign up or login with your details

Forgot password? Click here to reset