Algorithms for certain classes of Tamil Spelling correction

09/22/2019
by   Muthiah Annamalai, et al.
0

Tamil language has an agglutinative, diglossic, alpha-syllabary structure which provides a significant combinatorial explosion of morphological forms all of which are effectively used in Tamil prose, poetry from antiquity to the modern age in an unbroken chain of continuity. However, for the language understanding, spelling correction purposes some of these present challenges as out-of-dictionary words. In this paper the authors propose algorithmic techniques to handle specific problems of conjoined-words (out-of-dictionary) (transliteration)[thendRalkattRu] = [thendRal]+[kattRu] when parts are alone present in word-list in efficient way. Morphological structure of Tamil makes it necessary to depend on synthesis-analysis approach and dictionary lists will never be sufficient to truly capture the language. In this paper we have attempted to make a summary of various known algorithms for specific classes of Tamil spelling errors. We believe this collection of suggestions to improve future spelling checkers. We also note do not cover many important techniques like affix removal and other such techniques of key importance in rule-based spell checkers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/28/2022

UzbekStemmer: Development of a Rule-Based Stemming Algorithm for Uzbek Language

In this paper we present a rule-based stemming algorithm for the Uzbek l...
research
02/05/2021

Spell Correction for Azerbaijani Language using Deep Neural Networks

Spell correction is used to detect and correct orthographic mistakes in ...
research
10/06/2015

Analyzer and generator for Pali

This work describes a system that performs morphological analysis and ge...
research
07/07/2021

SinSpell: A Comprehensive Spelling Checker for Sinhala

We have built SinSpell, a comprehensive spelling checker for the Sinhala...
research
09/10/2020

The Grievance Dictionary: Understanding Threatening Language Use

This paper introduces the Grievance Dictionary, a psycholinguistic dicti...
research
03/02/2018

DEMorphy, German Language Morphological Analyzer

DEMorphy is a morphological analyzer for German. It is built onto large,...

Please sign up or login with your details

Forgot password? Click here to reset