Phrase-based Machine Translation is State-of-the-Art for Automatic Grammatical Error Correction

05/20/2016
by   Marcin Junczys-Dowmunt, et al.
0

In this work, we study parameter tuning towards the M^2 metric, the standard metric for automatic grammar error correction (GEC) tasks. After implementing M^2 as a scorer in the Moses tuning framework, we investigate interactions of dense and sparse features, different optimizers, and tuning strategies for the CoNLL-2014 shared task. We notice erratic behavior when optimizing sparse feature weights with M^2 and offer partial solutions. We find that a bare-bones phrase-based SMT setup with task-specific parameter-tuning outperforms all previously published results for the CoNLL-2014 test set by a large margin (46.37 while being trained on the same, publicly available data. Our newly introduced dense and sparse features widen that gap, and we improve the state-of-the-art to 49.49

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2016

Neural Network Translation Models for Grammatical Error Correction

Phrase-based statistical machine translation (SMT) systems have previous...
research
03/25/2019

Neural Grammatical Error Correction with Finite State Transducers

Grammatical error correction (GEC) is one of the areas in natural langua...
research
04/16/2018

Approaching Neural Grammatical Error Correction as a Low-Resource Machine Translation Task

Previously, neural methods in grammatical error correction (GEC) did not...
research
07/23/2019

Towards Unsupervised Grammatical Error Correction using Statistical Machine Translation with Synthetic Comparable Corpus

We introduce unsupervised techniques based on phrase-based statistical m...
research
09/02/2019

An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction

The incorporation of pseudo data in the training of grammatical error co...
research
06/01/2016

Exploiting N-Best Hypotheses to Improve an SMT Approach to Grammatical Error Correction

Grammatical error correction (GEC) is the task of detecting and correcti...
research
08/07/2020

Data Weighted Training Strategies for Grammatical Error Correction

Recent progress in the task of Grammatical Error Correction (GEC) has be...

Please sign up or login with your details

Forgot password? Click here to reset