Scene Graph Parsing by Attention Graph

09/13/2019
by   Martin Andrews, et al.
0

Scene graph representations, which form a graph of visual object nodes together with their attributes and relations, have proved useful across a variety of vision and language applications. Recent work in the area has used Natural Language Processing dependency tree methods to automatically build scene graphs. In this work, we present an 'Attention Graph' mechanism that can be trained end-to-end, and produces a scene graph structure that can be lifted directly from the top layer of a standard Transformer model. The scene graphs generated by our model achieve an F-score similarity of 52.21 surpassing the best previous approaches by 2.5

READ FULL TEXT
research
03/25/2018

Scene Graph Parsing as Dependency Parsing

In this paper, we study the problem of parsing structured knowledge grap...
research
11/17/2017

Neural Motifs: Scene Graph Parsing with Global Context

We investigate the problem of producing structured graph representations...
research
10/06/2020

Scene Graph Modification Based on Natural Language Commands

Structured representations like graphs and parse trees play a crucial ro...
research
09/17/2023

Optimal Scene Graph Planning with Large Language Model Guidance

Recent advances in metric, semantic, and topological mapping have equipp...
research
10/26/2022

Visual Semantic Parsing: From Images to Abstract Meaning Representation

The success of scene graphs for visual scene understanding has brought a...
research
03/27/2022

Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language Structures via Dependency Relationships

Understanding realistic visual scene images together with language descr...
research
04/26/2023

Scene Graph Lossless Compression with Adaptive Prediction for Objects and Relations

The scene graph is a new data structure describing objects and their pai...

Please sign up or login with your details

Forgot password? Click here to reset