Representation Degeneration Problem in Training Natural Language Generation Models

07/28/2019
by   Jun Gao, et al.
0

We study an interesting problem in training neural network-based models for natural language generation tasks, which we call the representation degeneration problem. We observe that when training a model for natural language generation tasks through likelihood maximization with the weight tying trick, especially with big training datasets, most of the learnt word embeddings tend to degenerate and be distributed into a narrow cone, which largely limits the representation power of word embeddings. We analyze the conditions and causes of this problem and propose a novel regularization method to address it. Experiments on language modeling and machine translation show that our method can largely mitigate the representation degeneration problem and achieve better performance than baseline algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/18/2018

FRAGE: Frequency-Agnostic Word Representation

Continuous word representation (aka word embedding) is a basic building ...
research
09/07/2021

Rare Words Degenerate All Words

Despite advances in neural network language model, the representation de...
research
11/06/2017

Distributed Representation for Traditional Chinese Medicine Herb via Deep Learning Models

Traditional Chinese Medicine (TCM) has accumulated a big amount of preci...
research
04/04/2019

ReWE: Regressing Word Embeddings for Regularization of Neural Machine Translation Systems

Regularization of neural machine translation is still a significant prob...
research
04/17/2018

When and Why are Pre-trainedWord Embeddings Useful for Neural Machine Translation?

The performance of Neural Machine Translation (NMT) systems often suffer...
research
12/19/2022

A Natural Bias for Language Generation Models

After just a few hundred training updates, a standard probabilistic mode...
research
08/27/2018

Predefined Sparseness in Recurrent Sequence Models

Inducing sparseness while training neural networks has been shown to yie...

Please sign up or login with your details

Forgot password? Click here to reset