DisSent: Sentence Representation Learning from Explicit Discourse Relations

10/12/2017
by   Allen Nie, et al.
0

Sentence vectors represent an appealing approach to meaning: learn an embedding that encompasses the meaning of a sentence in a single vector, that can be used for a variety of semantic tasks. Existing models for learning sentence embeddings either require extensive computational resources to train on large corpora, or are trained on costly, manually curated datasets of sentence relations. We observe that humans naturally annotate the relations between their sentences with discourse markers like "but" and "because". These words are deeply linked to the meanings of the sentences they connect. Using this natural signal, we automatically collect a classification dataset from unannotated text. Training a model to predict these discourse markers yields high quality sentence embeddings. Our model captures complementary information to existing models and achieves comparable generalization performance to state of the art models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/02/2020

DiscSense: Automated Semantic Analysis of Discourse Markers

Discourse markers ( by contrast, happily, etc.) are words or phrases th...
research
03/28/2019

Mining Discourse Markers for Unsupervised Sentence Representation Learning

Current state of the art systems in NLP heavily rely on manually annotat...
research
10/25/2016

Dis-S2V: Discourse Informed Sen2Vec

Vector representation of sentences is important for many text processing...
research
09/09/2018

Can Neural Generators for Dialogue Learn Sentence Planning and Discourse Structuring?

Responses in task-oriented dialogue systems often realize multiple propo...
research
09/05/2019

TransSent: Towards Generation of Structured Sentences with Discourse Marker

This paper focuses on the task of generating long structured sentences w...
research
10/06/2020

Semantically Driven Sentence Fusion: Modeling and Evaluation

Sentence fusion is the task of joining related sentences into coherent t...
research
04/23/2017

Discourse-Based Objectives for Fast Unsupervised Sentence Representation Learning

This work presents a novel objective function for the unsupervised train...

Please sign up or login with your details

Forgot password? Click here to reset