CUNI System for the Building Educational Applications 2019 Shared Task: Grammatical Error Correction

09/12/2019
by   Jakub Náplava, et al.
0

In this paper, we describe our systems submitted to the Building Educational Applications (BEA) 2019 Shared Task (Bryant et al., 2019). We participated in all three tracks. Our models are NMT systems based on the Transformer model, which we improve by incorporating several enhancements: applying dropout to whole source and target words, weighting target subwords, averaging model checkpoints, and using the trained model iteratively for correcting the intermediate translations. The system in the Restricted Track is trained on the provided corpora with oversampled "cleaner" sentences and reaches 59.39 F0.5 score on the test set. The system in the Low-Resource Track is trained from Wikipedia revision histories and reaches 44.13 F0.5 score. Finally, we finetune the system from the Low-Resource Track on restricted data and achieve 64.55 F0.5 score, placing third in the Unrestricted Track.

READ FULL TEXT

page 6

page 7

research
07/23/2019

Towards Unsupervised Grammatical Error Correction using Statistical Machine Translation with Synthetic Comparable Corpus

We introduce unsupervised techniques based on phrase-based statistical m...
research
07/02/2019

A Neural Grammatical Error Correction System Built On Better Pre-training and Sequential Transfer Learning

Grammatical error correction can be viewed as a low-resource sequence-to...
research
06/29/2019

The CUED's Grammatical Error Correction Systems for BEA-2019

We describe two entries from the Cambridge University Engineering Depart...
research
06/13/2023

NAVER LABS Europe's Multilingual Speech Translation Systems for the IWSLT 2023 Low-Resource Track

This paper presents NAVER LABS Europe's systems for Tamasheq-French and ...
research
03/23/2018

Leveraging translations for speech transcription in low-resource settings

Recently proposed data collection frameworks for endangered language doc...
research
06/10/2019

Learning to combine Grammatical Error Corrections

The field of Grammatical Error Correction (GEC) has produced various sys...
research
01/13/2021

Uzbek Cyrillic-Latin-Cyrillic Machine Transliteration

In this paper, we introduce a data-driven approach to transliterating Uz...

Please sign up or login with your details

Forgot password? Click here to reset