A Multilayer Convolutional Encoder-Decoder Neural Network for Grammatical Error Correction

01/26/2018
by   Shamil Chollampatt, et al.
0

We improve automatic correction of grammatical, orthographic, and collocation errors in text using a multilayer convolutional encoder-decoder neural network. The network is initialized with embeddings that make use of character N-gram information to better suit this task. When evaluated on common benchmark test data sets (CoNLL-2014 and JFLEG), our model substantially outperforms all prior neural approaches on this task as well as strong statistical machine translation-based systems with neural and task-specific features trained on the same data. Our analysis shows the superiority of convolutional neural networks over recurrent neural networks such as long short-term memory (LSTM) networks in capturing the local context via attention, and thereby improving the coverage in correcting grammatical errors. By ensembling multiple models, and incorporating an N-gram language model and edit features via rescoring, our novel method becomes the first neural approach to outperform the current state-of-the-art statistical machine translation-based approach, both in terms of grammaticality and fluency.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/31/2016

Neural Language Correction with Character-Based Attention

Natural language correction has the potential to help language learners ...
research
10/21/2022

Is Encoder-Decoder Redundant for Neural Machine Translation?

Encoder-decoder architecture is widely adopted for sequence-to-sequence ...
research
05/21/2017

Spelling Correction as a Foreign Language

In this paper, we reformulated the spell correction problem as a machine...
research
01/08/2019

Choosing the Right Word: Using Bidirectional LSTM Tagger for Writing Support Systems

Scientific writing is difficult. It is even harder for those for whom En...
research
07/07/2017

A Nested Attention Neural Hybrid Model for Grammatical Error Correction

Grammatical error correction (GEC) systems strive to correct both global...
research
06/07/2020

NITS-VC System for VATEX Video Captioning Challenge 2020

Video captioning is process of summarising the content, event and action...
research
10/09/2019

Kernel-Based Approaches for Sequence Modeling: Connections to Neural Methods

We investigate time-dependent data analysis from the perspective of recu...

Please sign up or login with your details

Forgot password? Click here to reset