Inherent Biases in Reference-based Evaluation for Grammatical Error Correction and Text Simplification

04/30/2018
by   Leshem Choshen, et al.
0

The prevalent use of too few references for evaluating text-to-text generation is known to bias estimates of their quality (henceforth, low coverage bias or LCB). This paper shows that overcoming LCB in Grammatical Error Correction (GEC) evaluation cannot be attained by re-scaling or by increasing the number of references in any feasible range, contrary to previous suggestions. This is due to the long-tailed distribution of valid corrections for a sentence. Concretely, we show that LCB incentivizes GEC systems to avoid correcting even when they can generate a valid correction. Consequently, existing systems obtain comparable or superior performance compared to humans, by making few but targeted changes to the input. Similar effects on Text Simplification further support our claims.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/11/2018

Reference-less Measure of Faithfulness for Grammatical Error Correction

We propose USim, a semantic measure for Grammatical Error Correction (G...
research
05/18/2023

CLEME: Debiasing Multi-reference Evaluation for Grammatical Error Correction

It is intractable to evaluate the performance of Grammatical Error Corre...
research
01/23/2019

Context-Sensitive Malicious Spelling Error Correction

Misspelled words of the malicious kind work by changing specific keyword...
research
10/23/2022

Focus Is What You Need For Chinese Grammatical Error Correction

Chinese Grammatical Error Correction (CGEC) aims to automatically detect...
research
08/17/2023

Evaluation of really good grammatical error correction

Although rarely stated, in practice, Grammatical Error Correction (GEC) ...
research
08/04/2020

An improved Bayesian TRIE based model for SMS text normalization

Normalization of SMS text, commonly known as texting language, is being ...
research
05/23/2022

Towards Automated Document Revision: Grammatical Error Correction, Fluency Edits, and Beyond

Natural language processing technology has rapidly improved automated gr...

Please sign up or login with your details

Forgot password? Click here to reset