A bird's-eye view on coherence, and a worm's-eye view on cohesion

11/01/2018
by   Woon Sang Cho, et al.
0

Generating coherent and cohesive long-form texts is a challenging problem in natural language generation. Previous works relied on a large amount of human-generated texts to train language models, however, few attempted to explicitly model the desired linguistic properties of natural language text, such as coherence and cohesion. In this work, we train two expert discriminators for coherence and cohesion, respectively, to provide hierarchical feedback for text generation. We also propose a simple variant of policy gradient, called 'negative-critical sequence training', using margin rewards, in which the 'baseline' is constructed from randomly generated negative samples. We demonstrate the effectiveness of our approach through empirical studies, showing significant improvements over the strong baseline -- attention-based bidirectional MLE-trained neural language model -- in a number of automated metrics. The proposed discriminators can serve as baseline architectures to promote further research to better extract, encode essential linguistic qualities, such as coherence and cohesion.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/22/2022

Learning to Write with Coherence From Negative Examples

Coherence is one of the critical factors that determine the quality of w...
research
10/20/2018

Hierarchical Text Generation using an Outline

Many challenges in natural language processing require generating text, ...
research
10/14/2021

Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling

Although large-scale pre-trained neural models have shown impressive per...
research
10/31/2018

Extracting Linguistic Resources from the Web for Concept-to-Text Generation

Many concept-to-text generation systems require domain-specific linguist...
research
09/15/2023

Self-Consistent Narrative Prompts on Abductive Natural Language Inference

Abduction has long been seen as crucial for narrative comprehension and ...
research
06/01/2019

Adversarial Generation and Encoding of Nested Texts

In this paper we propose a new language model called AGENT, which stands...
research
10/15/2021

Boosting coherence of language models

Naturality of long-term information structure – coherence – remains a ch...

Please sign up or login with your details

Forgot password? Click here to reset