Are Pre-trained Language Models Useful for Model Ensemble in Chinese Grammatical Error Correction?

05/24/2023
by   Chenming Tang, et al.
0

Model ensemble has been in widespread use for Grammatical Error Correction (GEC), boosting model performance. We hypothesize that model ensemble based on the perplexity (PPL) computed by pre-trained language models (PLMs) should benefit the GEC system. To this end, we explore several ensemble strategies based on strong PLMs with four sophisticated single models. However, the performance does not improve but even gets worse after the PLM-based ensemble. This surprising result sets us doing a detailed analysis on the data and coming up with some insights on GEC. The human references of correct sentences is far from sufficient in the test data, and the gap between a correct sentence and an idiomatic one is worth our attention. Moreover, the PLM-based ensemble strategies provide an effective way to extend and improve GEC benchmark data. Our source code is available at https://github.com/JamyDon/PLM-based-CGEC-Model-Ensemble.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/23/2022

MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction

This paper presents MuCGEC, a multi-reference multi-source evaluation da...
research
05/22/2023

Text-to-SQL Error Correction with Language Models of Code

Despite recent progress in text-to-SQL parsing, current semantic parsers...
research
09/15/2023

DiaCorrect: Error Correction Back-end For Speaker Diarization

In this work, we propose an error correction framework, named DiaCorrect...
research
06/19/2023

AMRs Assemble! Learning to Ensemble with Autoregressive Models for AMR Parsing

In this paper, we examine the current state-of-the-art in AMR parsing, w...
research
12/26/2018

ECG Segmentation by Neural Networks: Errors and Correction

In this study we examined the question of how error correction occurs in...
research
04/11/2018

Reference-less Measure of Faithfulness for Grammatical Error Correction

We propose USim, a semantic measure for Grammatical Error Correction (G...
research
06/03/2019

Gendered Ambiguous Pronouns Shared Task: Boosting Model Confidence by Evidence Pooling

This paper presents a strong set of results for resolving gendered ambig...

Please sign up or login with your details

Forgot password? Click here to reset