3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera

10/06/2019
by   Iro Armeni, et al.
28

A comprehensive semantic understanding of a scene is important for many applications - but in what space should diverse semantic information (e.g., objects, scene categories, material types, texture, etc.) be grounded and what should be its structure? Aspiring to have one unified structure that hosts diverse types of semantics, we follow the Scene Graph paradigm in 3D, generating a 3D Scene Graph. Given a 3D mesh and registered panoramic images, we construct a graph that spans the entire building and includes semantics on objects (e.g., class, material, and other attributes), rooms (e.g., scene category, volume, etc.) and cameras (e.g., location, etc.), as well as the relationships among these entities. However, this process is prohibitively labor heavy if done manually. To alleviate this we devise a semi-automatic framework that employs existing detection methods and enhances them using two main constraints: I. framing of query images sampled on panoramas to maximize the performance of 2D detectors, and II. multi-view consistency enforcement across 2D detections that originate in different camera locations.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 9

page 10

research
12/02/2021

Recognizing Scenes from Novel Viewpoints

Humans can perceive scenes in 3D from a handful of 2D views. For AI agen...
research
09/15/2022

Scene Graph Modification as Incremental Structure Expanding

A scene graph is a semantic representation that expresses the objects, a...
research
07/16/2018

Visual Graphs from Motion (VGfM): Scene understanding with object geometry reasoning

Recent approaches on visual scene understanding attempt to build a scene...
research
06/17/2019

Semi-Supervised Semantic Mapping through Label Propagation with Semantic Texture Meshes

Scene understanding is an important capability for robots acting in unst...
research
11/18/2014

A Unified Semantic Embedding: Relating Taxonomies and Attributes

We propose a method that learns a discriminative yet semantic space for ...
research
04/26/2023

Scene Graph Lossless Compression with Adaptive Prediction for Objects and Relations

The scene graph is a new data structure describing objects and their pai...
research
12/23/2013

Top Down Approach to Multiple Plane Detection

Detecting multiple planes in images is a challenging problem, but one wi...

Please sign up or login with your details

Forgot password? Click here to reset