DATScore: Evaluating Translation with Data Augmented Translations

10/12/2022
by   Moussa Kamal Eddine, et al.
0

The rapid development of large pretrained language models has revolutionized not only the field of Natural Language Generation (NLG) but also its evaluation. Inspired by the recent work of BARTScore: a metric leveraging the BART language model to evaluate the quality of generated text from various aspects, we introduce DATScore. DATScore uses data augmentation techniques to improve the evaluation of machine translation. Our main finding is that introducing data augmented translations of the source and reference texts is greatly helpful in evaluating the quality of the generated translation. We also propose two novel score averaging and term weighting strategies to improve the original score computing process of BARTScore. Experimental results on WMT show that DATScore correlates better with human meta-evaluations than the other recent state-of-the-art metrics, especially for low-resource languages. Ablation studies demonstrate the value added by our new scoring strategies. Moreover, we report in our extended experiments the performance of DATScore on 3 NLG tasks other than translation.

READ FULL TEXT
research
02/18/2023

How Good Are GPT Models at Machine Translation? A Comprehensive Evaluation

Generative Pre-trained Transformer (GPT) models have shown remarkable ca...
research
04/22/2020

DeepSubQE: Quality estimation for subtitle translations

Quality estimation (QE) for tasks involving language data is hard owing ...
research
06/06/2023

Iterative Translation Refinement with Large Language Models

Large language models have shown surprising performances in understandin...
research
05/23/2023

FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation

Evaluating the factuality of long-form text generated by large language ...
research
07/16/2019

Quality-aware skill translation models for expert finding on StackOverflow

StackOverflow has become an emerging resource for talent recognition in ...
research
07/28/2023

Multilingual Tourist Assistance using ChatGPT: Comparing Capabilities in Hindi, Telugu, and Kannada

This research investigates the effectiveness of ChatGPT, an AI language ...

Please sign up or login with your details

Forgot password? Click here to reset