Scene Graph Modification Based on Natural Language Commands

10/06/2020
by   Xuanli He, et al.
0

Structured representations like graphs and parse trees play a crucial role in many Natural Language Processing systems. In recent years, the advancements in multi-turn user interfaces necessitate the need for controlling and updating these structured representations given new sources of information. Although there have been many efforts focusing on improving the performance of the parsers that map text to graphs or parse trees, very few have explored the problem of directly manipulating these representations. In this paper, we explore the novel problem of graph modification, where the systems need to learn how to update an existing scene graph given a new user's command. Our novel models based on graph-based sparse transformer and cross attention information fusion outperform previous systems adapted from the machine translation and graph generation literature. We further contribute our large graph modification datasets to the research community to encourage future research for this new problem.

READ FULL TEXT

page 14

page 18

page 19

research
09/15/2022

Scene Graph Modification as Incremental Structure Expanding

A scene graph is a semantic representation that expresses the objects, a...
research
09/13/2019

Scene Graph Parsing by Attention Graph

Scene graph representations, which form a graph of visual object nodes t...
research
10/18/2021

Deep Transfer Learning Beyond: Transformer Language Models in Information Systems Research

AI is widely thought to be poised to transform business, yet current per...
research
09/15/2018

Improving Natural Language Inference Using External Knowledge in the Science Questions Domain

Natural Language Inference (NLI) is fundamental to many Natural Language...
research
07/07/2023

Open-Vocabulary Object Detection via Scene Graph Discovery

In recent years, open-vocabulary (OV) object detection has attracted inc...
research
06/20/2023

Transforming Graphs for Enhanced Attribute-Based Clustering: An Innovative Graph Transformer Method

Graph Representation Learning (GRL) is an influential methodology, enabl...
research
03/12/2020

Learning distributed representations of graphs with Geo2DR

We present Geo2DR, a Python library for unsupervised learning on graph-s...

Please sign up or login with your details

Forgot password? Click here to reset