Leveraging Neural Machine Translation for Word Alignment

03/31/2021
by   Vilém Zouhar, et al.
0

The most common tools for word-alignment rely on a large amount of parallel sentences, which are then usually processed according to one of the IBM model algorithms. The training data is, however, the same as for machine translation (MT) systems, especially for neural MT (NMT), which itself is able to produce word-alignments using the trained attention heads. This is convenient because word-alignment is theoretically a viable byproduct of any attention-based NMT, which is also able to provide decoder scores for a translated sentence pair. We summarize different approaches on how word-alignment can be extracted from alignment scores and then explore ways in which scores can be extracted from NMT, focusing on inferring the word-alignment scores based on output sentence and token probabilities. We compare this to the extraction of alignment scores from attention. We conclude with aggregating all of the sources of alignment scores into a simple feed-forward network which achieves the best results when combined alignment extractors are used.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/14/2016

Neural Machine Translation with Supervised Attention

The attention mechanisim is appealing for neural machine translation, si...
research
01/14/2016

Smoothing parameter estimation framework for IBM word alignment models

IBM models are very important word alignment models in Machine Translati...
research
04/30/2020

End-to-End Neural Word Alignment Outperforms GIZA++

Word alignment was once a core unsupervised learning task in natural lan...
research
10/17/2017

Paying Attention to Multi-Word Expressions in Neural Machine Translation

Processing of multi-word expressions (MWEs) is a known problem for any n...
research
11/30/2022

Word Alignment in the Era of Deep Learning: A Tutorial

The word alignment task, despite its prominence in the era of statistica...
research
09/04/2019

Jointly Learning to Align and Translate with Transformer Models

The state of the art in machine translation (MT) is governed by neural a...
research
06/25/2019

Saliency-driven Word Alignment Interpretation for Neural Machine Translation

Despite their original goal to jointly learn to align and translate, Neu...

Please sign up or login with your details

Forgot password? Click here to reset