Better Datastore, Better Translation: Generating Datastores from Pre-Trained Models for Nearest Neural Machine Translation

12/17/2022
by   Jiahuan Li, et al.
0

Nearest Neighbor Machine Translation (kNNMT) is a simple and effective method of augmenting neural machine translation (NMT) with a token-level nearest neighbor retrieval mechanism. The effectiveness of kNNMT directly depends on the quality of retrieved neighbors. However, original kNNMT builds datastores based on representations from NMT models, which would result in poor retrieval accuracy when NMT models are not good enough, leading to sub-optimal translation performance. In this paper, we propose PRED, a framework that leverages Pre-trained models for Datastores in kNN-MT. Better representations from pre-trained models allow us to build datastores of better quality. We also design a novel contrastive alignment objective to mitigate the representation gap between the NMT model and pre-trained models, enabling the NMT model to retrieve from better datastores. We conduct extensive experiments on both bilingual and multilingual translation benchmarks, including WMT17 English ↔ Chinese, WMT14 English ↔ German, IWSLT14 German ↔ English, and IWSLT14 multilingual datasets. Empirical results demonstrate the effectiveness of PRED.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/23/2023

Simple and Scalable Nearest Neighbor Machine Translation

kNN-MT is a straightforward yet powerful approach for fast domain adapta...
research
08/29/2023

CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation

There has been a growing interest in developing multimodal machine trans...
research
01/18/2023

Understanding and Detecting Hallucinations in Neural Machine Translation via Model Introspection

Neural sequence generation models are known to "hallucinate", by produci...
research
04/16/2023

A Comprehensive Evaluation of the Copy Mechanism for Natural Language to SPARQL Query Generation

In recent years, the field of neural machine translation (NMT) for SPARQ...
research
02/15/2021

Meta Back-translation

Back-translation is an effective strategy to improve the performance of ...
research
04/07/2021

Better Neural Machine Translation by Extracting Linguistic Information from BERT

Adding linguistic information (syntax or semantics) to neural machine tr...
research
10/23/2022

Translation Word-Level Auto-Completion: What can we achieve out of the box?

Research on Machine Translation (MT) has achieved important breakthrough...

Please sign up or login with your details

Forgot password? Click here to reset