Towards Lithuanian grammatical error correction

03/18/2022
by   Lukas Stankevičius, et al.
0

Everyone wants to write beautiful and correct text, yet the lack of language skills, experience, or hasty typing can result in errors. By employing the recent advances in transformer architectures, we construct a grammatical error correction model for Lithuanian, the language rich in archaic features. We compare subword and byte-level approaches and share our best trained model, achieving F_0.5=0.92, and accompanying code, in an online open-source repository.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/27/2018

Building a Lemmatizer and a Spell-checker for Sorani Kurdish

The present paper aims at presenting a lemmatization and a word-level er...
research
07/26/2023

GrammarGPT: Exploring Open-Source LLMs for Native Chinese Grammatical Error Correction with Supervised Fine-Tuning

Grammatical error correction aims to correct ungrammatical sentences aut...
research
09/18/2023

HTEC: Human Transcription Error Correction

High-quality human transcription is essential for training and improving...
research
04/21/2016

OCR Error Correction Using Character Correction and Feature-Based Word Classification

This paper explores the use of a learned classifier for post-OCR text co...
research
08/14/2016

Numerically Grounded Language Models for Semantic Error Correction

Semantic error detection and correction is an important task for applica...
research
08/03/2021

Fast BCH Coding for Optimal Robust Image Watermarking in DCT Domain

This paper investigates a novel approach of digital image watermarking b...
research
03/13/2020

Investigating Error Injection to Enhance the Effectiveness of Mobile Text Entry Studies of Error Behaviour

During lab studies of text entry methods it is typical to observer very ...

Please sign up or login with your details

Forgot password? Click here to reset