Pose2Room: Understanding 3D Scenes from Human Activities

12/01/2021
by   Yinyu Nie, et al.
0

With wearable IMU sensors, one can estimate human poses from wearable devices without requiring visual input <cit.>. In this work, we pose the question: Can we reason about object structure in real-world environments solely from human trajectory information? Crucially, we observe that human motion and interactions tend to give strong information about the objects in a scene – for instance a person sitting indicates the likely presence of a chair or sofa. To this end, we propose P2R-Net to learn a probabilistic 3D model of the objects in a scene characterized by their class categories and oriented 3D bounding boxes, based on an input observed human trajectory in the environment. P2R-Net models the probability distribution of object class as well as a deep Gaussian mixture model for object boxes, enabling sampling of multiple, diverse, likely modes of object configurations from an observed human trajectory. In our experiments we demonstrate that P2R-Net can effectively learn multi-modal distributions of likely objects for human motions, and produce a variety of plausible object structures of the environment, even without any visual information.

READ FULL TEXT

page 3

page 6

page 7

page 15

page 17

page 18

page 19

page 20

research
12/08/2022

MIME: Human-Aware 3D Scene Generation

Generating realistic 3D worlds occupied by moving humans has many applic...
research
06/27/2012

Learning Object Arrangements in 3D Scenes using Human Context

We consider the problem of learning object arrangements in a 3D scene. T...
research
11/24/2018

What and Where: A Context-based Recommendation System for Object Insertion

In this work, we propose a novel topic consisting of two dual tasks: 1) ...
research
03/06/2021

Indoor Future Person Localization from an Egocentric Wearable Camera

Accurate prediction of future person location and movement trajectory fr...
research
10/12/2019

MultiPath: Multiple Probabilistic Anchor Trajectory Hypotheses for Behavior Prediction

Predicting human behavior is a difficult and crucial task required for m...
research
11/25/2022

Learning 3D Scene Priors with 2D Supervision

Holistic 3D scene understanding entails estimation of both layout config...
research
01/16/2020

Contextual Sense Making by Fusing Scene Classification, Detections, and Events in Full Motion Video

With the proliferation of imaging sensors, the volume of multi-modal ima...

Please sign up or login with your details

Forgot password? Click here to reset