FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation

10/01/2022
by   Parker Riley, et al.
0

We present FRMT, a new dataset and evaluation benchmark for Few-shot Region-aware Machine Translation, a type of style-targeted translation. The dataset consists of professional translations from English into two regional variants each of Portuguese and Mandarin Chinese. Source documents are selected to enable detailed analysis of phenomena of interest, including lexically distinct terms and distractor terms. We explore automatic evaluation metrics for FRMT and validate their correlation with expert human evaluation across both region-matched and mismatched rating scenarios. Finally, we present a number of baseline models for this task, and offer guidelines for how researchers can train, evaluate, and compare their own models. Our dataset and evaluation code are publicly available: https://bit.ly/frmt-task

READ FULL TEXT

page 8

page 9

research
02/19/2022

PETCI: A Parallel English Translation Dataset of Chinese Idioms

Idioms are an important language phenomenon in Chinese, but idiom transl...
research
07/21/2023

Incorporating Human Translator Style into English-Turkish Literary Machine Translation

Although machine translation systems are mostly designed to serve in the...
research
10/04/2017

Discourse Structure in Machine Translation Evaluation

In this article, we explore the potential of using sentence-level discou...
research
08/21/2018

Has Machine Translation Achieved Human Parity? A Case for Document-level Evaluation

Recent research suggests that neural machine translation achieves parity...
research
04/03/2020

A Set of Recommendations for Assessing Human-Machine Parity in Language Translation

The quality of machine translation has increased remarkably over the pas...
research
08/10/2015

Improve the Evaluation of Fluency Using Entropy for Machine Translation Evaluation Metrics

The widely-used automatic evaluation metrics cannot adequately reflect t...
research
02/10/2022

Identifying Weaknesses in Machine Translation Metrics Through Minimum Bayes Risk Decoding: A Case Study for COMET

Neural metrics have achieved impressive correlation with human judgement...

Please sign up or login with your details

Forgot password? Click here to reset