Assessing Composition in Sentence Vector Representations

09/11/2018
by   Allyson Ettinger, et al.
0

An important component of achieving language understanding is mastering the composition of sentence meaning, but an immediate challenge to solving this problem is the opacity of sentence vector representations produced by current neural sentence composition models. We present a method to address this challenge, developing tasks that directly target compositional meaning information in sentence vector representations with a high degree of precision and control. To enable the creation of these controlled tasks, we introduce a specialized sentence generation system that produces large, annotated sentence sets meeting specified syntactic, semantic and lexical constraints. We describe the details of the method and generation system, and then present results of experiments applying our method to probe for compositional information in embeddings from a number of existing sentence composition models. We find that the method is able to extract useful information about the differing capacities of these models, and we discuss the implications of our results with respect to these systems' capturing of sentence information. We make available for public use the datasets used for these experiments, as well as the generation system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/10/2015

Syntax-Aware Multi-Sense Word Embeddings for Deep Compositional Models of Meaning

Deep compositional models of meaning acting on distributional representa...
research
01/21/2023

Syntax-guided Neural Module Distillation to Probe Compositionality in Sentence Embeddings

Past work probing compositionality in sentence embedding models faces is...
research
03/21/2022

Quality Controlled Paraphrase Generation

Paraphrase generation has been widely used in various downstream tasks. ...
research
06/14/2021

Improving Paraphrase Detection with the Adversarial Paraphrasing Task

If two sentences have the same meaning, it should follow that they are e...
research
10/24/2021

Distributed neural encoding of binding to thematic roles

A framework and method are proposed for the study of constituent composi...
research
11/02/2022

Unsupervised Syntactically Controlled Paraphrase Generation with Abstract Meaning Representations

Syntactically controlled paraphrase generation has become an emerging re...
research
04/13/2021

Should Semantic Vector Composition be Explicit? Can it be Linear

Vector representations have become a central element in semantic languag...

Please sign up or login with your details

Forgot password? Click here to reset