Predicting Perfect Quality Segments in MT Output with Fine-Tuned OpenAI LLM: Is it possible to capture editing distance patterns from historical data?

07/31/2023
by   Serge Gladkoff, et al.
0

Translation Quality Estimation (TQE) is an important step before deploying the output translation into usage. TQE is also critical in assessing machine translation (MT) and human translation (HT) quality without seeing the reference translations. In this work, we examine if the state-of-the-art large language models (LLMs) can be fine-tuned for the TQE task and their capability. We take ChatGPT as one example and approach TQE as a binary classification task. Using English to Italian, German, French, Japanese, Dutch, Portuguese, Turkish, and Chinese training corpora, our experimental results show that fine-tuned ChatGPT via its API can achieve a relatively high score on predicting translation quality, i.e. if the translation needs to be edited, but there is definitely much space to improve the accuracy. English-Italiano bilingual Abstract is available in the paper.

READ FULL TEXT

page 3

page 4

research
02/01/2023

An Evaluation of Persian-English Machine Translation Datasets with Transformers

Nowadays, many researchers are focusing their attention on the subject o...
research
09/08/2021

Ensemble Fine-tuned mBERT for Translation Quality Estimation

Quality Estimation (QE) is an important component of the machine transla...
research
06/01/2019

Learning to Transfer: Unsupervised Meta Domain Translation

Unsupervised domain translation has recently achieved impressive perform...
research
10/24/2022

Bilingual Synchronization: Restoring Translational Relationships with Editing Operations

Machine Translation (MT) is usually viewed as a one-shot process that ge...
research
11/14/2018

The ADAPT System Description for the IWSLT 2018 Basque to English Translation Task

In this paper we present the ADAPT system built for the Basque to Englis...
research
05/19/2016

Automatic TM Cleaning through MT and POS Tagging: Autodesk's Submission to the NLP4TM 2016 Shared Task

We describe a machine learning based method to identify incorrect entrie...
research
10/23/2022

Translation Word-Level Auto-Completion: What can we achieve out of the box?

Research on Machine Translation (MT) has achieved important breakthrough...

Please sign up or login with your details

Forgot password? Click here to reset