Personalizing Grammatical Error Correction: Adaptation to Proficiency Level and L1

06/04/2020
by   Maria Nădejde, et al.
0

Grammar error correction (GEC) systems have become ubiquitous in a variety of software applications, and have started to approach human-level performance for some datasets. However, very little is known about how to efficiently personalize these systems to the user's characteristics, such as their proficiency level and first language, or to emerging domains of text. We present the first results on adapting a general-purpose neural GEC system to both the proficiency level and the first language of a writer, using only a few thousand annotated sentences. Our study is the broadest of its kind, covering five proficiency levels and twelve different languages, and comparing three different adaptation scenarios: adapting to the proficiency level only, to the first language only, or to both aspects simultaneously. We show that tailoring to both scenarios achieves the largest performance improvement (3.6 F0.5) relative to a strong baseline.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/31/2021

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

We present a corpus professionally annotated for grammatical error corre...
research
01/10/2020

Towards Minimal Supervision BERT-based Grammar Error Correction

Current grammatical error correction (GEC) models typically consider the...
research
01/29/2021

Few-Shot Domain Adaptation for Grammatical Error Correction via Meta-Learning

Most existing Grammatical Error Correction (GEC) methods based on sequen...
research
06/27/2023

Evaluating GPT-3.5 and GPT-4 on Grammatical Error Correction for Brazilian Portuguese

We investigate the effectiveness of GPT-3.5 and GPT-4, two large languag...
research
02/12/2023

An Extended Sequence Tagging Vocabulary for Grammatical Error Correction

We extend a current sequence-tagging approach to Grammatical Error Corre...
research
10/15/2020

Grammatical Error Correction in Low Error Density Domains: A New Benchmark and Analyses

Evaluation of grammatical error correction (GEC) systems has primarily f...
research
11/15/2022

Adaptation Approaches for Nearest Neighbor Language Models

Semi-parametric Nearest Neighbor Language Models (kNN-LMs) have produced...

Please sign up or login with your details

Forgot password? Click here to reset