Visual Graphs from Motion (VGfM): Scene understanding with object geometry reasoning

07/16/2018
by   Paul Gay, et al.
0

Recent approaches on visual scene understanding attempt to build a scene graph -- a computational representation of objects and their pairwise relationships. Such rich semantic representation is very appealing, yet difficult to obtain from a single image, especially when considering complex spatial arrangements in the scene. Differently, an image sequence conveys useful information using the multi-view geometric relations arising from camera motion. Indeed, in such cases, object relationships are naturally related to the 3D scene structure. To this end, this paper proposes a system that first computes the geometrical location of objects in a generic scene and then efficiently constructs scene graphs from video by embedding such geometrical reasoning. Such compelling representation is obtained using a new model where geometric and visual features are merged using an RNN framework. We report results on a dataset we created for the task of 3D scene graph generation in multiple views.

READ FULL TEXT

page 5

page 14

research
11/30/2022

Iterative Scene Graph Generation with Generative Transformers

Scene graphs provide a rich, structured representation of a scene by enc...
research
05/12/2021

Image interpretation by iterative bottom-up top-down processing

Scene understanding requires the extraction and representation of scene ...
research
09/07/2022

VGStore: A Multimodal Extension to SPARQL for Querying RDF Scene Graph

Semantic Web technology has successfully facilitated many RDF models wit...
research
12/08/2022

Latent Graph Representations for Critical View of Safety Assessment

Assessing the critical view of safety in laparoscopic cholecystectomy re...
research
09/26/2022

Totems: Physical Objects for Verifying Visual Integrity

We introduce a new approach to image forensics: placing physical refract...
research
12/05/2018

Explainable and Explicit Visual Reasoning over Scene Graphs

We aim to dismantle the prevalent black-box neural architectures used in...
research
10/06/2019

3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera

A comprehensive semantic understanding of a scene is important for many ...

Please sign up or login with your details

Forgot password? Click here to reset