A New Tool for Efficiently Generating Quality Estimation Datasets

11/01/2021
by   Sugyeong Eo, et al.
0

Building of data for quality estimation (QE) training is expensive and requires significant human labor. In this study, we focus on a data-centric approach while performing QE, and subsequently propose a fully automatic pseudo-QE dataset generation tool that generates QE datasets by receiving only monolingual or parallel corpus as the input. Consequently, the QE performance is enhanced either by data augmentation or by encouraging multiple language pairs to exploit the applicability of QE. Further, we intend to publicly release this user friendly QE dataset generation tool as we believe this tool provides a new, inexpensive method to the community for developing QE datasets.

READ FULL TEXT

page 1

page 2

page 3

research
11/24/2021

A Self-Supervised Automatic Post-Editing Data Generation Tool

Data building for automatic post-editing (APE) requires extensive and ex...
research
10/30/2021

How should human translation coexist with NMT? Efficient tool for building high quality parallel corpus

This paper proposes a tool for efficiently constructing high-quality par...
research
06/06/2023

"A Little is Enough": Few-Shot Quality Estimation based Corpus Filtering improves Machine Translation

Quality Estimation (QE) is the task of evaluating the quality of a trans...
research
11/23/2021

AutoDC: Automated data-centric processing

AutoML (automated machine learning) has been extensively developed in th...
research
05/17/2023

Kitana: Efficient Data Augmentation Search for AutoML

AutoML services provide a way for non-expert users to benefit from high-...
research
06/25/2022

ConcreteGraph: A Data Augmentation Method Leveraging the Properties of Concept Relatedness Estimation

The concept relatedness estimation (CRE) task is to determine whether tw...
research
03/10/2021

Majority Voting with Bidirectional Pre-translation For Bitext Retrieval

Obtaining high-quality parallel corpora is of paramount importance for t...

Please sign up or login with your details

Forgot password? Click here to reset