Enabling Language Models to Fill in the Blanks

05/11/2020
by   Chris Donahue, et al.
0

We present a simple approach for text infilling, the task of predicting missing spans of text at any position in a document. While infilling could enable rich functionality especially for writing assistance tools, more attention has been devoted to language modeling—a special case of infilling where text is predicted at the end of a document. In this paper, we aim to extend the capabilities of language models (LMs) to the more general task of infilling. To this end, we train (or fine-tune) off-the-shelf LMs on sequences containing the concatenation of artificially-masked text and the text which was masked. We show that this approach, which we call infilling by language modeling, can enable LMs to infill entire sentences effectively on three different domains: short stories, scientific abstracts, and lyrics. Furthermore, we show that humans have difficulty identifying sentences infilled by our approach as machine-generated in the domain of short stories.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/07/2021

Understanding by Understanding Not: Modeling Negation in Language Models

Negation is a core construction in natural language. Despite being very ...
research
03/21/2022

Language modeling via stochastic processes

Modern language models can generate high-quality short texts. However, t...
research
03/15/2022

Do Language Models Plagiarize?

Past literature has illustrated that language models do not fully unders...
research
04/14/2021

IGA : An Intent-Guided Authoring Assistant

While large-scale pretrained language models have significantly improved...
research
06/22/2020

Clinical Predictive Keyboard using Statistical and Neural Language Modeling

A language model can be used to predict the next word during authoring, ...
research
04/18/2021

Go Forth and Prosper: Language Modeling with Ancient Textual History

We introduce a technique for improving document-level language models (L...
research
05/12/2022

A Generalist Agent

Inspired by progress in large-scale language modeling, we apply a simila...

Please sign up or login with your details

Forgot password? Click here to reset