High Fidelity Vector Space Models of Structured Data

01/09/2019
by   Maxwell Crouse, et al.
6

Machine learning systems regularly deal with structured data in real-world applications. Unfortunately, such data has been difficult to faithfully represent in a way that most machine learning techniques would expect, i.e. as a real-valued vector of a fixed, pre-specified size. In this work, we introduce a novel approach that compiles structured data into a satisfiability problem which has in its set of solutions at least (and often only) the input data. The satisfiability problem is constructed from constraints which are generated automatically a priori from a given signature, thus trivially allowing for a bag-of-words-esque vector representation of the input to be constructed. The method is demonstrated in two areas, automated reasoning and natural language processing, where it is shown to be near-perfect in producing vector representations of natural-language sentences and first-order logic clauses that can be translated back to their original, structured input forms.

READ FULL TEXT
research
05/12/2017

Evaluating vector-space models of analogy

Vector-space representations provide geometric tools for reasoning about...
research
04/02/2019

Neural Vector Conceptualization for Word Vector Space Interpretation

Distributed word vector spaces are considered hard to interpret which hi...
research
02/10/2023

Translating Natural Language to Planning Goals with Large-Language Models

Recent large language models (LLMs) have demonstrated remarkable perform...
research
02/21/2017

Neural Multi-Step Reasoning for Question Answering on Semi-Structured Tables

Advances in natural language processing tasks have gained momentum in re...
research
01/24/2022

Cobol2Vec: Learning Representations of Cobol code

There has been a steadily growing interest in development of novel metho...
research
06/23/2021

False perfection in machine prediction: Detecting and assessing circularity problems in machine learning

Machine learning algorithms train models from patterns of input data and...
research
10/12/2018

Embedding Geographic Locations for Modelling the Natural Environment using Flickr Tags and Structured Data

Meta-data from photo-sharing websites such as Flickr can be used to obta...

Please sign up or login with your details

Forgot password? Click here to reset