Document Context Language Models

11/12/2015
by   Yangfeng Ji, et al.
0

Text documents are structured on multiple levels of detail: individual words are related by syntax, but larger units of text are related by discourse structure. Existing language models generally fail to account for discourse structure, but it is crucial if we are to have language models that reward coherence and generate coherent texts. We present and empirically evaluate a set of multi-level recurrent neural network language models, called Document-Context Language Models (DCLM), which incorporate contextual information both within and beyond the sentence. In comparison with word-level recurrent neural network language models, the DCLM models obtain slightly better predictive likelihoods, and considerably better assessments of document coherence.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/13/2021

Discourse Probing of Pretrained Language Models

Existing work on probing of pretrained language models (LMs) has predomi...
research
01/12/2022

PhysNLU: A Language Resource for Evaluating Natural Language Understanding and Explanation Coherence in Physics

In order for language models to aid physics research, they must first en...
research
06/16/2023

Investigating the Utility of Surprisal from Large Language Models for Speech Synthesis Prosody

This paper investigates the use of word surprisal, a measure of the pred...
research
04/11/2016

Using Sentence-Level LSTM Language Models for Script Inference

There is a small but growing body of research on statistical scripts, mo...
research
05/07/2021

Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction from Language Models

Coherent discourse is distinguished from a mere collection of utterances...
research
08/30/2019

Linguistic Versus Latent Relations for Modeling Coherent Flow in Paragraphs

Generating a long, coherent text such as a paragraph requires a high-lev...
research
08/16/2015

Online Representation Learning in Recurrent Neural Language Models

We investigate an extension of continuous online learning in recurrent n...

Please sign up or login with your details

Forgot password? Click here to reset