Language-Conditioned Graph Networks for Relational Reasoning

05/10/2019
by   Ronghang Hu, et al.
6

Solving grounded language tasks often requires reasoning about relationships between objects in the context of a given task. For example, to answer the question "What color is the mug on the plate?" we must check the color of the specific mug that satisfies the "on" relationship with respect to the plate. Recent work has proposed various methods capable of complex relational reasoning. However, most of their power is in the inference structure, while the scene is represented with simple local appearance features. In this paper, we take an alternate approach and build contextualized representations for objects in a visual scene to support relational reasoning. We propose a general framework of Language-Conditioned Graph Networks (LCGN), where each node represents an object, and is described by a context-aware representation from related objects through iterative message passing conditioned on the textual input. E.g., conditioning on the "on" relationship to the plate, the object "mug" gathers messages from the object "plate" to update its representation to "mug on the plate", which can be easily consumed by a simple classifier for answer prediction. We experimentally show that our LCGN approach effectively supports relational reasoning and improves performance across several tasks and datasets.

READ FULL TEXT

page 1

page 4

page 8

page 12

page 13

page 14

research
04/05/2020

Iterative Context-Aware Graph Inference for Visual Dialog

Visual dialog is a challenging task that requires the comprehension of t...
research
02/15/2022

Hyper-relationship Learning Network for Scene Graph Generation

Generating informative scene graphs from images requires integrating and...
research
04/30/2020

Dynamic Language Binding in Relational Visual Reasoning

We present Language-binding Object Graph Network, the first neural reaso...
research
04/22/2020

Graph-based Kinship Reasoning Network

In this paper, we propose a graph-based kinship reasoning (GKR) network ...
research
06/06/2019

3D-RelNet: Joint Object and Relational Network for 3D Prediction

We propose an approach to predict the 3D shape and pose for the objects ...
research
06/18/2018

Modularity Matters: Learning Invariant Relational Reasoning Tasks

We focus on two supervised visual reasoning tasks whose labels encode a ...
research
12/06/2018

Local Conditioning: Exact Message Passing for Cyclic Undirected Distributed Networks

This paper addresses practical implementation of summing out, expanding,...

Please sign up or login with your details

Forgot password? Click here to reset