JFLEG: A Fluency Corpus and Benchmark for Grammatical Error Correction

02/14/2017
by   Courtney Napoles, et al.
0

We present a new parallel corpus, JHU FLuency-Extended GUG corpus (JFLEG) for developing and evaluating grammatical error correction (GEC). Unlike other corpora, it represents a broad range of language proficiency levels and uses holistic fluency edits to not only correct grammatical errors but also make the original text more native sounding. We describe the types of corrections made and benchmark four leading GEC systems on this corpus, identifying specific areas in which they do well and how they can improve. JFLEG fulfills the need for a new gold standard to properly assess the current state of GEC.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/31/2021

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

We present a corpus professionally annotated for grammatical error corre...
research
05/29/2023

Byte-Level Grammatical Error Correction Using Synthetic and Curated Corpora

Grammatical error correction (GEC) is the task of correcting typos, spel...
research
04/05/2019

Cross-Corpora Evaluation and Analysis of Grammatical Error Correction Models --- Is Single-Corpus Evaluation Enough?

This study explores the necessity of performing cross-corpora evaluation...
research
02/12/2023

An Extended Sequence Tagging Vocabulary for Grammatical Error Correction

We extend a current sequence-tagging approach to Grammatical Error Corre...
research
10/15/2020

Grammatical Error Correction in Low Error Density Domains: A New Benchmark and Analyses

Evaluation of grammatical error correction (GEC) systems has primarily f...
research
07/12/2023

Misclassification in Automated Content Analysis Causes Bias in Regression. Can We Fix It? Yes We Can!

Automated classifiers (ACs), often built via supervised machine learning...
research
05/26/2019

Evaluation of basic modules for isolated spelling error correction in Polish texts

Spelling error correction is an important problem in natural language pr...

Please sign up or login with your details

Forgot password? Click here to reset