Comparison of Grammatical Error Correction Using Back-Translation Models

04/16/2021
by   Aomi Koyama, et al.
0

Grammatical error correction (GEC) suffers from a lack of sufficient parallel data. Therefore, GEC studies have developed various methods to generate pseudo data, which comprise pairs of grammatical and artificially produced ungrammatical sentences. Currently, a mainstream approach to generate pseudo data is back-translation (BT). Most previous GEC studies using BT have employed the same architecture for both GEC and BT models. However, GEC models have different correction tendencies depending on their architectures. Thus, in this study, we compare the correction tendencies of the GEC models trained on pseudo data generated by different BT models, namely, Transformer, CNN, and LSTM. The results confirm that the correction tendencies for each error type are different for every BT model. Additionally, we examine the correction tendencies when using a combination of pseudo data generated by different BT models. As a result, we find that the combination of different BT models improves or interpolates the F_0.5 scores of each error type compared with that of single BT models with different seeds.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/02/2019

An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction

The incorporation of pseudo data in the training of grammatical error co...
research
03/17/2022

Type-Driven Multi-Turn Corrections for Grammatical Error Correction

Grammatical Error Correction (GEC) aims to automatically detect and corr...
research
11/01/2018

Spelling Error Correction Using a Nested RNN Model and Pseudo Training Data

We propose a nested recurrent neural network (nested RNN) model for Engl...
research
04/05/2019

Cross-Corpora Evaluation and Analysis of Grammatical Error Correction Models --- Is Single-Corpus Evaluation Enough?

This study explores the necessity of performing cross-corpora evaluation...
research
09/24/2018

An Automated Approach Towards Sparse Single-Equation Cointegration Modelling

In this paper we propose the Single-equation Penalized Error Correction ...
research
06/10/2019

Learning to combine Grammatical Error Corrections

The field of Grammatical Error Correction (GEC) has produced various sys...
research
06/16/2023

Improving Audio Caption Fluency with Automatic Error Correction

Automated audio captioning (AAC) is an important cross-modality translat...

Please sign up or login with your details

Forgot password? Click here to reset