Structured Generative Models for Scene Understanding

This position paper argues for the use of structured generative models (SGMs) for scene understanding. This requires the reconstruction of a 3D scene from an input image, whereby the contents of the image are causally explained in terms of models of instantiated objects, each with their own type, shape, appearance and pose, along with global variables like scene lighting and camera parameters. This approach also requires scene models which account for the co-occurrences and inter-relationships of objects in a scene. The SGM approach has the merits that it is compositional and generative, which lead to interpretability. To pursue the SGM agenda, we need models for objects and scenes, and approaches to carry out inference. We first review models for objects, which include “things” (object categories that have a well defined shape), and “stuff” (categories which have amorphous spatial extent). We then move on to review scene models which describe the inter-relationships of objects. Perhaps the most challenging problem for SGMs is inference of the objects, lighting and camera parameters, and scene inter-relationships from input consisting of a single or multiple images. We conclude with a discussion of issues that need addressing to advance the SGM agenda.

READ FULL TEXT

page 3

page 4

page 6

page 8

page 10

page 12

page 16

research
10/30/2021

3DP3: 3D Scene Perception via Probabilistic Programming

We present 3DP3, a framework for inverse graphics that uses inference in...
research
04/01/2021

Exploiting Relationship for Complex-scene Image Generation

The significant progress on Generative Adversarial Networks (GANs) has f...
research
03/28/2016

Attend, Infer, Repeat: Fast Scene Understanding with Generative Models

We present a framework for efficient inference in structured image model...
research
03/24/2023

UrbanGIRAFFE: Representing Urban Scenes as Compositional Generative Neural Feature Fields

Generating photorealistic images with controllable camera pose and scene...
research
08/18/2016

IM2CAD

Given a single photo of a room and a large database of furniture CAD mod...
research
02/02/2015

Adaptive Scene Category Discovery with Generative Learning and Compositional Sampling

This paper investigates a general framework to discover categories of un...
research
11/19/2014

Affordances Provide a Fundamental Categorization Principle for Visual Scenes

How do we know that a kitchen is a kitchen by looking? Relatively little...

Please sign up or login with your details

Forgot password? Click here to reset