A Self-Refinement Strategy for Noise Reduction in Grammatical Error Correction

10/07/2020
by   Masato Mita, et al.
0

Existing approaches for grammatical error correction (GEC) largely rely on supervised learning with manually created GEC datasets. However, there has been little focus on verifying and ensuring the quality of the datasets, and on how lower-quality data might affect GEC performance. We indeed found that there is a non-negligible amount of "noise" where errors were inappropriately edited or left uncorrected. To address this, we designed a self-refinement method where the key idea is to denoise these datasets by leveraging the prediction consistency of existing models, and outperformed strong denoising baseline methods. We further applied task-specific techniques and achieved state-of-the-art performance on the CoNLL-2014, JFLEG, and BEA-2019 benchmarks. We then analyzed the effect of the proposed denoising method, and found that our approach leads to improved coverage of corrections and facilitated fluency edits which are reflected in higher recall and overall performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/12/2021

Improving Translation Robustness with Visual Cues and Error Correction

Neural Machine Translation models are brittle to input noise. Current ro...
research
11/09/2016

A Brief Survey of Non-Residue Based Computational Error Correction

The idea of computational error correction has been around for over half...
research
06/27/2023

Evaluating GPT-3.5 and GPT-4 on Grammatical Error Correction for Brazilian Portuguese

We investigate the effectiveness of GPT-3.5 and GPT-4, two large languag...
research
08/16/2023

How to Mask in Error Correction Code Transformer: Systematic and Double Masking

In communication and storage systems, error correction codes (ECCs) are ...
research
01/20/2022

Construction of a Quality Estimation Dataset for Automatic Evaluation of Japanese Grammatical Error Correction

In grammatical error correction (GEC), automatic evaluation is an import...
research
01/17/2022

Proficiency Matters Quality Estimation in Grammatical Error Correction

This study investigates how supervised quality estimation (QE) models of...
research
10/26/2019

ETNet: Error Transition Network for Arbitrary Style Transfer

Numerous valuable efforts have been devoted to achieving arbitrary style...

Please sign up or login with your details

Forgot password? Click here to reset