SGRAM: Improving Scene Graph Parsing via Abstract Meaning Representation

10/17/2022
by   Woo Suk Choi, et al.
0

Scene graph is structured semantic representation that can be modeled as a form of graph from images and texts. Image-based scene graph generation research has been actively conducted until recently, whereas text-based scene graph generation research has not. In this paper, we focus on the problem of scene graph parsing from textual description of a visual scene. The core idea is to use abstract meaning representation (AMR) instead of the dependency parsing mainly used in previous studies. AMR is a graph-based semantic formalism of natural language which abstracts concepts of words in a sentence contrary to the dependency parsing which considers dependency relationships on all words in a sentence. To this end, we design a simple yet effective two-stage scene graph parsing framework utilizing abstract meaning representation, SGRAM (Scene GRaph parsing via Abstract Meaning representation): 1) transforming a textual description of an image into an AMR graph (Text-to-AMR) and 2) encoding the AMR graph into a Transformer-based language model to generate a scene graph (AMR-to-SG). Experimental results show the scene graphs generated by our framework outperforms the dependency parsing-based model by 11.61% and the previous state-of-the-art model using a pre-trained Transformer language model by 3.78%. Furthermore, we apply SGRAM to image retrieval task which is one of downstream tasks for scene graph, and confirm the effectiveness of scene graphs generated by our framework.

READ FULL TEXT

page 3

page 6

research
03/25/2018

Scene Graph Parsing as Dependency Parsing

In this paper, we study the problem of parsing structured knowledge grap...
research
10/26/2022

Visual Semantic Parsing: From Images to Abstract Meaning Representation

The success of scene graphs for visual scene understanding has brought a...
research
03/05/2021

Parsing Indonesian Sentence into Abstract Meaning Representation using Machine Learning Approach

Abstract Meaning Representation (AMR) provides many information of a sen...
research
11/16/2017

ConvAMR: Abstract meaning representation parsing for legal document

Convolutional neural networks (CNN) have recently achieved remarkable pe...
research
05/05/2021

Polynomial Graph Parsing with Non-Structural Reentrancies

Graph-based semantic representations are valuable in natural language pr...
research
06/15/2022

A Survey : Neural Networks for AMR-to-Text

AMR-to-text is one of the key techniques in the NLP community that aims ...
research
04/02/2023

Learning Similarity between Scene Graphs and Images with Transformers

Scene graph generation is conventionally evaluated by (mean) Recall@K, w...

Please sign up or login with your details

Forgot password? Click here to reset