Dictionary-based Phrase-level Prompting of Large Language Models for Machine Translation

02/15/2023
by   Marjan Ghazvininejad, et al.
0

Large language models (LLMs) demonstrate remarkable machine translation (MT) abilities via prompting, even though they were not explicitly trained for this task. However, even given the incredible quantities of data they are trained on, LLMs can struggle to translate inputs with rare words, which are common in low resource or domain transfer scenarios. We show that LLM prompting can provide an effective solution for rare words as well, by using prior knowledge from bilingual dictionaries to provide control hints in the prompts. We propose a novel method, DiPMT, that provides a set of possible translations for a subset of the input words, thereby enabling fine-grained phrase-level prompted control of the LLM. Extensive experiments show that DiPMT outperforms the baseline both in low-resource MT, as well as for out-of-domain MT. We further provide a qualitative analysis of the benefits and limitations of this approach, including the overall level of controllability that is achieved.

READ FULL TEXT

page 7

page 8

research
05/11/2023

Chain-of-Dictionary Prompting Elicits Translation in Large Language Models

Large language models (LLMs) have shown surprisingly good performance in...
research
01/26/2018

Context Models for OOV Word Translation in Low-Resource Languages

Out-of-vocabulary word translation is a major problem for the translatio...
research
01/18/2015

Phrase Based Language Model For Statistical Machine Translation

We consider phrase based Language Models (LM), which generalize the comm...
research
02/21/2019

Development of a classifiers/quantifiers dictionary towards French-Japanese MT

Although classifiers/quantifiers (CQs) expressions appear frequently in ...
research
08/05/2022

Phrase translation using a bilingual dictionary and n-gram data: A case study from Vietnamese to English

Past approaches to translate a phrase in a language L1 to a language L2 ...
research
04/16/2018

Approaching Neural Grammatical Error Correction as a Low-Resource Machine Translation Task

Previously, neural methods in grammatical error correction (GEC) did not...
research
07/03/2021

Can Transformers Jump Around Right in Natural Language? Assessing Performance Transfer from SCAN

Despite their practical success, modern seq2seq architectures are unable...

Please sign up or login with your details

Forgot password? Click here to reset