Minimum Risk Training for Neural Machine Translation

12/08/2015
by   Shiqi Shen, et al.
0

We propose minimum risk training for end-to-end neural machine translation. Unlike conventional maximum likelihood estimation, minimum risk training is capable of optimizing model parameters directly with respect to arbitrary evaluation metrics, which are not necessarily differentiable. Experiments show that our approach achieves significant improvements over maximum likelihood estimation on a state-of-the-art neural machine translation system across various languages pairs. Transparent to architectures, our approach can be applied to more neural networks and potentially benefit more NLP tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/20/2016

Lexicons and Minimum Risk Training for Neural Machine Translation: NAIST-CMU at WAT2016

This year, the Nara Institute of Science and Technology (NAIST)/Carnegie...
research
06/04/2020

MLE-guided parameter search for task loss minimization in neural sequence modeling

Neural autoregressive sequence models are used to generate sequences in ...
research
04/07/2016

Neural Headline Generation with Sentence-wise Optimization

Recently, neural models have been proposed for headline generation by le...
research
06/20/2017

THUMT: An Open Source Toolkit for Neural Machine Translation

This paper introduces THUMT, an open-source toolkit for neural machine t...
research
09/14/2019

Beyond BLEU: Training Neural Machine Translation with Semantic Similarity

While most neural machine translation (NMT) systems are still trained us...
research
07/17/2023

Enhancing Supervised Learning with Contrastive Markings in Neural Machine Translation Training

Supervised learning in Neural Machine Translation (NMT) typically follow...
research
06/06/2021

Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation

The shortcomings of maximum likelihood estimation in the context of mode...

Please sign up or login with your details

Forgot password? Click here to reset