Conciseness: An Overlooked Language Task

11/08/2022
by   Felix Stahlberg, et al.
0

We report on novel investigations into training models that make sentences concise. We define the task and show that it is different from related tasks such as summarization and simplification. For evaluation, we release two test sets, consisting of 2000 sentences each, that were annotated by two and five human annotators, respectively. We demonstrate that conciseness is a difficult task for which zero-shot setups with large neural language models often do not perform well. Given the limitations of these approaches, we propose a synthetic data generation method based on round-trip translations. Using this data to either train Transformers from scratch or fine-tune T5 models yields our strongest baselines that can be further improved by fine-tuning on an artificial conciseness dataset that we derived from multi-annotator machine translation test sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/11/2021

Unsupervised Neural Machine Translation with Generative Language Models Only

We show how to derive state-of-the-art unsupervised neural machine trans...
research
08/23/2023

Instruction Position Matters in Sequence Generation with Large Language Models

Large language models (LLMs) are capable of performing conditional seque...
research
09/26/2022

Towards Fine-Dining Recipe Generation with Generative Pre-trained Transformers

Food is essential to human survival. So much so that we have developed d...
research
11/05/2020

Detecting Hallucinated Content in Conditional Neural Sequence Generation

Neural sequence models can generate highly fluent sentences but recent s...
research
09/14/2021

Learning Bill Similarity with Annotated and Augmented Corpora of Bills

Bill writing is a critical element of representative democracy. However,...
research
06/25/2020

THEaiTRE: Artificial Intelligence to Write a Theatre Play

We present THEaiTRE, a starting project aimed at automatic generation of...
research
06/03/2022

Findings of the The RuATD Shared Task 2022 on Artificial Text Detection in Russian

We present the shared task on artificial text detection in Russian, whic...

Please sign up or login with your details

Forgot password? Click here to reset