Cross-lingual and cross-domain discourse segmentation of entire documents

04/13/2017
by   Chloé Braud, et al.
0

Discourse segmentation is a crucial step in building end-to-end discourse parsers. However, discourse segmenters only exist for a few languages and domains. Typically they only detect intra-sentential segment boundaries, assuming gold standard sentence and token segmentation, and relying on high-quality syntactic parses and rich heuristics that are not generally available across languages and domains. In this paper, we propose statistical discourse segmenters for five languages and three domains that do not rely on gold pre-annotations. We also consider the problem of learning discourse segmenters when no labeled data is available for a language. Our fully supervised system obtains 89.5 performance on other domains, and we report supervised and unsupervised (cross-lingual) results for five languages in total.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/11/2017

Cross-lingual RST Discourse Parsing

Discourse parsing is an integral part of understanding information flow ...
research
09/16/2019

Bridging the domain gap in cross-lingual document classification

The scarcity of labeled training data often prohibits the internationali...
research
12/03/2020

Multilingual Neural RST Discourse Parsing

Text discourse parsing plays an important role in understanding informat...
research
04/14/2019

From News to Medical: Cross-domain Discourse Segmentation

The first step in discourse analysis involves dividing a text into segme...
research
02/10/2020

Automatic Discourse Segmentation: an evaluation in French

In this article, we describe some discursive segmentation methods as wel...
research
06/07/2021

X2Parser: Cross-Lingual and Cross-Domain Framework for Task-Oriented Compositional Semantic Parsing

Task-oriented compositional semantic parsing (TCSP) handles complex nest...
research
08/28/2018

Toward Fast and Accurate Neural Discourse Segmentation

Discourse segmentation, which segments texts into Elementary Discourse U...

Please sign up or login with your details

Forgot password? Click here to reset