Error-Correcting Neural Sequence Prediction

01/21/2019
by   James O'Neill, et al.
2

In this paper we propose a novel neural language modelling (NLM) method based on error-correcting output codes (ECOC), abbreviated as ECOC-NLM. This latent variable based approach provides a principled way to choose a varying amount of latent output codes and avoids exact softmax normalization. Instead of minimizing measures between the predicted probability distribution and true distribution, we use error-correcting codes to represent both predictions and outputs. Secondly, we propose multiple ways to improve accuracy and convergence rates by maximizing the separability between codes that correspond to classes proportional to word embedding similarities. Lastly, we introduce a novel method called Latent Mixture Sampling, a technique that is used to mitigate exposure bias and can be integrated into training latent-based neural language models. This involves mixing the latent codes (i.e variables) of past predictions and past targets in one of two ways: (1) according to a predefined sampling schedule or (2) a differentiable sampling procedure whereby the mixing probability is learned throughout training by replacing the greedy argmax operation with a smooth approximation. In evaluating Codeword Mixture Sampling for ECOC-NLM, we also baseline it against CWMS in a closely related Hierarhical Softmax-based NLM.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/16/2018

Error correcting codes from sub-exceeding fonction

In this paper, we present error-correcting codes which are the results o...
research
04/23/2017

Differentiable Scheduled Sampling for Credit Assignment

We demonstrate that a continuous relaxation of the argmax operation can ...
research
04/23/2017

Neural Machine Translation via Binary Code Prediction

In this paper, we propose a new method for calculating the output layer ...
research
09/16/2018

Curriculum-Based Neighborhood Sampling For Sequence Prediction

The task of multi-step ahead prediction in language models is challengin...
research
11/27/2018

Maximizing Multivariate Information with Error-Correcting Codes

Multivariate mutual information provides a conceptual framework for char...
research
05/10/2023

Discovery of Optimal Quantum Error Correcting Codes via Reinforcement Learning

The recently introduced Quantum Lego framework provides a powerful metho...

Please sign up or login with your details

Forgot password? Click here to reset