Scenarios: A New Representation for Complex Scene Understanding

02/16/2018
by   Zachary A. Daniels, et al.
0

The ability for computational agents to reason about the high-level content of real world scene images is important for many applications. Existing attempts at addressing the problem of complex scene understanding lack representational power, efficiency, and the ability to create robust meta-knowledge about scenes. In this paper, we introduce scenarios as a new way of representing scenes. The scenario is a simple, low-dimensional, data-driven representation consisting of sets of frequently co-occurring objects and is useful for a wide range of scene understanding tasks. We learn scenarios from data using a novel matrix factorization method which we integrate into a new neural network architecture, the ScenarioNet. Using ScenarioNet, we can recover semantic information about real world scene images at three levels of granularity: 1) scene categories, 2) scenarios, and 3) objects. Training a single ScenarioNet model enables us to perform scene classification, scenario recognition, multi-object recognition, content-based scene image retrieval, and content-based image comparison. In addition to solving many tasks in a single, unified framework, ScenarioNet is more computationally efficient than other CNNs because it requires significantly fewer parameters while achieving similar performance on benchmark tasks and is more interpretable because it produces explanations when making decisions. We validate the utility of scenarios and ScenarioNet on a diverse set of scene understanding tasks on several benchmark datasets.

READ FULL TEXT
research
04/05/2016

Radiometric Scene Decomposition: Scene Reflectance, Illumination, and Geometry from RGB-D Images

Recovering the radiometric properties of a scene (i.e., the reflectance,...
research
12/02/2021

Recognizing Scenes from Novel Viewpoints

Humans can perceive scenes in 3D from a handful of 2D views. For AI agen...
research
06/09/2022

Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields

Comprehensive 3D scene understanding, both geometrically and semanticall...
research
04/17/2023

RS2G: Data-Driven Scene-Graph Extraction and Embedding for Robust Autonomous Perception and Scenario Understanding

Human drivers naturally reason about interactions between road users to ...
research
07/16/2022

Scene Graph for Embodied Exploration in Cluttered Scenario

The ability to handle objects in cluttered environment has been long ant...
research
08/27/2018

Single Shot Scene Text Retrieval

Textual information found in scene images provides high level semantic i...
research
04/22/2021

Aerial Scene Understanding in The Wild: Multi-Scene Recognition via Prototype-based Memory Networks

Aerial scene recognition is a fundamental visual task and has attracted ...

Please sign up or login with your details

Forgot password? Click here to reset