Automatic Document Sketching: Generating Drafts from Analogous Texts

06/14/2021
by   Zeqiu Wu, et al.
0

The advent of large pre-trained language models has made it possible to make high-quality predictions on how to add or change a sentence in a document. However, the high branching factor inherent to text generation impedes the ability of even the strongest language models to offer useful editing suggestions at a more global or document level. We introduce a new task, document sketching, which involves generating entire draft documents for the writer to review and revise. These drafts are built from sets of documents that overlap in form - sharing large segments of potentially reusable text - while diverging in content. To support this task, we introduce a Wikipedia-based dataset of analogous documents and investigate the application of weakly supervised methods, including use of a transformer-based mixture of experts, together with reinforcement learning. We report experiments using automated and human evaluation methods and discuss relative merits of these models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/02/2020

Citation Text Generation

We introduce the task of citation text generation: given a pair of scien...
research
03/28/2023

Synthetically generated text for supervised text analysis

Supervised text models are a valuable tool for political scientists but ...
research
10/16/2020

Substance over Style: Document-Level Targeted Content Transfer

Existing language models excel at writing from scratch, but many real-wo...
research
10/20/2020

Elaborative Simplification: Content Addition and Explanation Generation in Text Simplification

Much of modern day text simplification research focuses on sentence-leve...
research
11/10/2012

Dating Texts without Explicit Temporal Cues

This paper tackles temporal resolution of documents, such as determining...
research
12/07/2020

Topical Change Detection in Documents via Embeddings of Long Sequences

In a longer document, the topic often slightly shifts from one passage t...
research
10/09/2022

Analogy Generation by Prompting Large Language Models: A Case Study of InstructGPT

We propose a novel application of prompting Pre-trained Language Models ...

Please sign up or login with your details

Forgot password? Click here to reset