Classification-based Quality Estimation: Small and Efficient Models for Real-world Applications

09/17/2021
by   Shuo Sun, et al.
0

Sentence-level Quality estimation (QE) of machine translation is traditionally formulated as a regression task, and the performance of QE models is typically measured by Pearson correlation with human labels. Recent QE models have achieved previously-unseen levels of correlation with human judgments, but they rely on large multilingual contextualized language models that are computationally expensive and make them infeasible for real-world applications. In this work, we evaluate several model compression techniques for QE and find that, despite their popularity in other NLP tasks, they lead to poor performance in this regression setting. We observe that a full model parameterization is required to achieve SoTA results in a regression task. However, we argue that the level of expressiveness of a model in a continuous range is unnecessary given the downstream applications of QE, and show that reframing QE as a classification problem and evaluating QE models using classification metrics would better reflect their actual performance in real-world applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/29/2020

Revisiting Round-Trip Translation for Quality Estimation

Quality estimation (QE) is the task of automatically evaluating the qual...
research
02/08/2021

Quality Estimation without Human-labeled Data

Quality estimation aims to measure the quality of translated content wit...
research
01/21/2023

Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference

Machine translation quality estimation (QE) predicts human judgements of...
research
06/04/2019

Off-Policy Evaluation via Off-Policy Classification

In this work, we consider the problem of model selection for deep reinfo...
research
06/11/2021

A Discussion on Building Practical NLP Leaderboards: The Case of Machine Translation

Recent advances in AI and ML applications have benefited from rapid prog...
research
06/13/2023

CipherSniffer: Classifying Cipher Types

Ciphers are a powerful tool for encrypting communication. There are many...
research
07/19/2019

Direct information transfer rate optimisation for SSVEP-based BCI

In this work, a classification method for SSVEP-based BCI is proposed. T...

Please sign up or login with your details

Forgot password? Click here to reset