How Good Are GPT Models at Machine Translation? A Comprehensive Evaluation

02/18/2023
by   Amr Hendy, et al.
0

Generative Pre-trained Transformer (GPT) models have shown remarkable capabilities for natural language generation, but their performance for machine translation has not been thoroughly investigated. In this paper, we present a comprehensive evaluation of GPT models for machine translation, covering various aspects such as quality of different GPT models in comparison with state-of-the-art research and commercial systems, effect of prompting strategies, robustness towards domain shifts and document-level translation. We experiment with eighteen different translation directions involving high and low resource languages, as well as non English-centric translations, and evaluate the performance of three GPT models: ChatGPT, GPT3.5 (text-davinci-003), and text-davinci-002. Our results show that GPT models achieve very competitive translation quality for high resource languages, while having limited capabilities for low resource languages. We also show that hybrid approaches, which combine GPT models with other translation systems, can further enhance the translation quality. We perform comprehensive analysis and human evaluation to further understand the characteristics of GPT translations. We hope that our paper provides valuable insights for researchers and practitioners in the field and helps to better understand the potential and limitations of GPT models for translation.

READ FULL TEXT

page 15

page 16

research
02/27/2022

OCR Improves Machine Translation for Low-Resource Languages

We aim to investigate the performance of current OCR systems on low reso...
research
10/12/2022

DATScore: Evaluating Translation with Data Augmented Translations

The rapid development of large pretrained language models has revolution...
research
04/05/2023

Unleashing the Power of ChatGPT for Translation: An Empirical Study

The recently released ChatGPT has demonstrated surprising abilities in n...
research
08/17/2023

On the Evaluation of Neural Code Translation: Taxonomy and Benchmark

In recent years, neural code translation has gained increasing attention...
research
03/28/2023

Hallucinations in Large Multilingual Translation Models

Large-scale multilingual machine translation systems have demonstrated r...
research
07/18/2021

As Easy as 1, 2, 3: Behavioural Testing of NMT Systems for Numerical Translation

Mistranslated numbers have the potential to cause serious effects, such ...
research
10/06/2022

Toxicity in Multilingual Machine Translation at Scale

Machine Translation systems can produce different types of errors, some ...

Please sign up or login with your details

Forgot password? Click here to reset