DiscSense: Automated Semantic Analysis of Discourse Markers

06/02/2020
by   Damien Sileo, et al.
0

Discourse markers ( by contrast, happily, etc.) are words or phrases that are used to signal semantic and/or pragmatic relationships between clauses or sentences. Recent work has fruitfully explored the prediction of discourse markers between sentence pairs in order to learn accurate sentence representations, that are useful in various classification tasks. In this work, we take another perspective: using a model trained to predict discourse markers between sentence pairs, we predict plausible markers between sentence pairs with a known semantic relation (provided by existing classification datasets). These predictions allow us to study the link between discourse markers and the semantic relations annotated in classification datasets. Handcrafted mappings have been proposed between markers and discourse relations on a limited set of markers and a limited set of categories, but there exist hundreds of discourse markers expressing a wide variety of relations, and there is no consensus on the taxonomy of relations between competing discourse theories (which are largely built in a top-down fashion). By using an automatic rediction method over existing semantically annotated datasets, we provide a bottom-up characterization of discourse markers in English. The resulting dataset, named DiscSense, is publicly available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/12/2017

DisSent: Sentence Representation Learning from Explicit Discourse Relations

Sentence vectors represent an appealing approach to meaning: learn an em...
research
03/28/2019

Mining Discourse Markers for Unsupervised Sentence Representation Learning

Current state of the art systems in NLP heavily rely on manually annotat...
research
10/06/2020

QADiscourse – Discourse Relations as QA Pairs: Representation, Crowdsourcing and Baselines

Discourse relations describe how two propositions relate to one another,...
research
01/02/2021

Multitask Learning for Class-Imbalanced Discourse Classification

Small class-imbalanced datasets, common in many high-level semantic task...
research
01/08/2020

A Neural Approach to Discourse Relation Signal Detection

Previous data-driven work investigating the types and distributions of d...
research
02/03/2017

Automatic Prediction of Discourse Connectives

Accurate prediction of suitable discourse connectives (however, furtherm...
research
10/16/2021

A Dataset for Discourse Structure in Peer Review Discussions

At the foundation of scientific evaluation is the labor-intensive proces...

Please sign up or login with your details

Forgot password? Click here to reset