Towards Effective Disambiguation for Machine Translation with Large Language Models

09/20/2023
by   Vivek Iyer, et al.
0

Resolving semantic ambiguity has long been recognised as a central challenge in the field of machine translation. Recent work on benchmarking translation performance on ambiguous sentences has exposed the limitations of conventional Neural Machine Translation (NMT) systems, which fail to capture many of these cases. Large language models (LLMs) have emerged as a promising alternative, demonstrating comparable performance to traditional NMT models while introducing new paradigms for controlling the target outputs. In this paper, we study the capabilities of LLMs to translate ambiguous sentences containing polysemous words and rare word senses. We also propose two ways to improve the handling of such ambiguity through in-context learning and fine-tuning on carefully curated ambiguous datasets. Experiments show that our methods can match or outperform state-of-the-art systems such as DeepL and NLLB in four out of five language directions. Our research provides valuable insights into effectively adapting LLMs for disambiguation during machine translation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/22/2017

Handling Homographs in Neural Machine Translation

Homographs, words with different meanings but the same surface form, hav...
research
05/23/2023

Empowering LLM-based Machine Translation with Cultural Awareness

Traditional neural machine translation (NMT) systems often fail to trans...
research
09/06/2023

Gender-specific Machine Translation with Large Language Models

Decoder-only Large Language Models (LLMs) have demonstrated potential in...
research
08/26/2019

Transductive Data-Selection Algorithms for Fine-Tuning Neural Machine Translation

Machine Translation models are trained to translate a variety of documen...
research
09/26/2021

Curb Your Carbon Emissions: Benchmarking Carbon Emissions in Machine Translation

In recent times, there has been definitive progress in the field of NLP,...
research
02/25/2019

Lost in Machine Translation: A Method to Reduce Meaning Loss

A desideratum of high-quality translation systems is that they preserve ...
research
05/27/2023

Augmenting Large Language Model Translators via Translation Memories

Using translation memories (TMs) as prompts is a promising approach to i...

Please sign up or login with your details

Forgot password? Click here to reset