DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding

03/16/2016
by   Yinda Zhang, et al.
0

While deep neural networks have led to human-level performance on computer vision tasks, they have yet to demonstrate similar gains for holistic scene understanding. In particular, 3D context has been shown to be an extremely important cue for scene understanding - yet very little research has been done on integrating context information with deep models. This paper presents an approach to embed 3D context into the topology of a neural network trained to perform holistic scene understanding. Given a depth image depicting a 3D scene, our network aligns the observed scene with a predefined 3D scene template, and then reasons about the existence and location of each object within the scene template. In doing so, our model recognizes multiple objects in a single forward pass of a 3D convolutional neural network, capturing both global scene and local object information simultaneously. To create training data for this 3D network, we generate partly hallucinated depth images which are rendered by replacing real objects with a repository of CAD models of the same object category. Extensive experiments demonstrate the effectiveness of our algorithm compared to the state-of-the-arts. Source code and data are available at http://deepcontext.cs.princeton.edu.

READ FULL TEXT

page 1

page 2

page 4

page 5

page 6

page 8

research
03/11/2021

Holistic 3D Scene Understanding from a Single Image with Implicit Representation

We present a new pipeline for holistic 3D scene understanding from a sin...
research
06/16/2014

Human-Machine CRFs for Identifying Bottlenecks in Holistic Scene Understanding

Recent trends in image understanding have pushed for holistic scene unde...
research
11/09/2022

Understanding Cross-modal Interactions in V L Models that Generate Scene Descriptions

Image captioning models tend to describe images in an object-centric way...
research
02/20/2022

3DRM:Pair-wise relation module for 3D object detection

Context has proven to be one of the most important factors in object lay...
research
08/15/2022

HoW-3D: Holistic 3D Wireframe Perception from a Single Image

This paper studies the problem of holistic 3D wireframe perception (HoW-...
research
11/24/2015

Searching for Objects using Structure in Indoor Scenes

To identify the location of objects of a particular class, a passive com...
research
06/02/2023

Towards In-context Scene Understanding

In-context learningx2013the ability to configure a model's behavior with...

Please sign up or login with your details

Forgot password? Click here to reset