Adversarial Generation and Encoding of Nested Texts

06/01/2019
by   Alon Rozental, et al.
0

In this paper we propose a new language model called AGENT, which stands for Adversarial Generation and Encoding of Nested Texts. AGENT is designed for encoding, generating and refining documents that consist of a long and coherent text, such as an entire book, provided they are hierarchically annotated (nested). i.e. divided into sentences, paragraphs and chapters. The core idea of our system is learning vector representations for each level of the text hierarchy (sentences, paragraphs, etc...), and train each such representation to perform 3 tasks: The task of reconstructing the sequence of vectors from a lower level that was used to create the representation, and generalized versions of the Masked Language Modeling (MLM) and "Next Sentence Prediction" tasks from BERT Devlin et al. [2018]. Additionally we present a new adversarial model for long text generation and suggest a way to improve the coherence of the generated text by traversing its vector representation tree.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/19/2021

Long Text Generation by Modeling Sentence-Level and Discourse-Level Coherence

Generating long and coherent text is an important but challenging task, ...
research
03/21/2022

Language modeling via stochastic processes

Modern language models can generate high-quality short texts. However, t...
research
08/14/2018

Top-Down Tree Structured Text Generation

Text generation is a fundamental building block in natural language proc...
research
09/07/2020

Improving Language Generation with Sentence Coherence Objective

Conditional story generation and contextual text continuation have becom...
research
09/24/2017

Long Text Generation via Adversarial Training with Leaked Information

Automatically generating coherent and semantically meaningful text has m...
research
11/01/2018

A bird's-eye view on coherence, and a worm's-eye view on cohesion

Generating coherent and cohesive long-form texts is a challenging proble...
research
05/03/2023

Towards Imperceptible Document Manipulations against Neural Ranking Models

Adversarial attacks have gained traction in order to identify potential ...

Please sign up or login with your details

Forgot password? Click here to reset