Relating Neural Text Degeneration to Exposure Bias

09/17/2021
by   Ting-Rui Chiang, et al.
0

This work focuses on relating two mysteries in neural-based text generation: exposure bias, and text degeneration. Despite the long time since exposure bias was mentioned and the numerous studies for its remedy, to our knowledge, its impact on text generation has not yet been verified. Text degeneration is a problem that the widely-used pre-trained language model GPT-2 was recently found to suffer from (Holtzman et al., 2020). Motivated by the unknown causation of the text degeneration, in this paper we attempt to relate these two mysteries. Specifically, we first qualitatively quantitatively identify mistakes made before text degeneration occurs. Then we investigate the significance of the mistakes by inspecting the hidden states in GPT-2. Our results show that text degeneration is likely to be partly caused by exposure bias. We also study the self-reinforcing mechanism of text degeneration, explaining why the mistakes amplify. In sum, our study provides a more concrete foundation for further investigation on exposure bias and text degeneration problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/30/2018

Evaluating Text GANs as Language Models

Generative Adversarial Networks (GANs) are a promising approach for text...
research
05/25/2019

Quantifying Exposure Bias for Neural Language Generation

The exposure bias problem refers to the training-inference discrepancy c...
research
08/16/2023

Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction

Text-to-Text Transfer Transformer (T5) has recently been considered for ...
research
04/03/2022

Why Exposure Bias Matters: An Imitation Learning Perspective of Error Accumulation in Language Generation

Current language generation models suffer from issues such as repetition...
research
04/30/2018

Towards Diverse Text Generation with Inverse Reinforcement Learning

Text generation is a crucial task in NLP. Recently, several adversarial ...
research
05/07/2020

On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation

The standard training algorithm in neural machine translation (NMT) suff...
research
03/07/2023

Overview of the Evaluation Methods for the Maximum EMF Exposure in 5G Networks

Instantaneous measurements of the electromagnetic field (EMF) strength d...

Please sign up or login with your details

Forgot password? Click here to reset