Encoding Spatial Relations from Natural Language

07/04/2018
by   Tiago Ramalho, et al.
2

Natural language processing has made significant inroads into learning the semantics of words through distributional approaches, however representations learnt via these methods fail to capture certain kinds of information implicit in the real world. In particular, spatial relations are encoded in a way that is inconsistent with human spatial reasoning and lacking invariance to viewpoint changes. We present a system capable of capturing the semantics of spatial relations such as behind, left of, etc from natural language. Our key contributions are a novel multi-modal objective based on generating images of scenes from their textual descriptions, and a new dataset on which to train it. We demonstrate that internal representations are robust to meaning preserving transformations of descriptions (paraphrase invariance), while viewpoint invariance is an emergent property of the system.

READ FULL TEXT

page 4

page 8

page 13

page 14

page 15

page 16

research
11/28/2021

Natural Language and Spatial Rules

We develop a system that formally represents spatial semantics concepts ...
research
07/20/2016

Robust Natural Language Processing - Combining Reasoning, Cognitive Semantics and Construction Grammar for Spatial Language

We present a system for generating and understanding of dynamic and stat...
research
07/19/2020

From Spatial Relations to Spatial Configurations

Spatial Reasoning from language is essential for natural language unders...
research
08/28/2019

SpatialNLI: A Spatial Domain Natural Language Interface to Databases Using Spatial Comprehension

A natural language interface (NLI) to databases is an interface that tra...
research
03/22/2018

Text2Shape: Generating Shapes from Natural Language by Learning Joint Embeddings

We present a method for generating colored 3D shapes from natural langua...
research
10/02/2019

Embodied Language Grounding with Implicit 3D Visual Feature Representations

Consider the utterance "the tomato is to the left of the pot." Humans ca...
research
12/10/2017

Learning Interpretable Spatial Operations in a Rich 3D Blocks World

In this paper, we study the problem of mapping natural language instruct...

Please sign up or login with your details

Forgot password? Click here to reset