CohEval: Benchmarking Coherence Models

04/30/2020
by   Tasnim Mohiuddin, et al.
0

Although coherence modeling has come a long way in developing novel models, their evaluation on downstream applications has largely been neglected. With the advancements made by neural approaches in applications such as machine translation, text summarization and dialogue systems, the need for standard coherence evaluation is now more crucial than ever. In this paper, we propose to benchmark coherence models on a number of synthetic and downstream tasks. In particular, we evaluate well-known traditional and neural coherence models on sentence ordering tasks, and also on three downstream applications including coherence evaluation for machine translation, summarization and next utterance prediction. We also show model produced rankings for pre-trained language model outputs as another use-case. Our results demonstrate a weak correlation between the model performances in the synthetic tasks and the downstream applications, motivating alternate evaluation methods for coherence models. This work has led us to create a leaderboard to foster further research in coherence modeling.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/01/2019

A Unified Neural Coherence Model

Recently, neural approaches to coherence modeling have achieved state-of...
research
12/06/2018

Context is Key: New Approaches to Neural Coherence Modeling

We formulate coherence modeling as a regression task and propose two nov...
research
11/27/2020

FFCI: A Framework for Interpretable Automatic Evaluation of Summarization

In this paper, we propose FFCI, a framework for automatic summarization ...
research
10/14/2021

Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling

Although large-scale pre-trained neural models have shown impressive per...
research
09/05/2021

Transformer Models for Text Coherence Assessment

Coherence is an important aspect of text quality and is crucial for ensu...
research
03/22/2019

Pre-trained Language Model Representations for Language Generation

Pre-trained language model representations have been successful in a wid...
research
06/05/2020

Evaluating Text Coherence at Sentence and Paragraph Levels

In this paper, to evaluate text coherence, we propose the paragraph orde...

Please sign up or login with your details

Forgot password? Click here to reset