DeepAI
Log In Sign Up

Automatic Language Identification System for Hindi and Magahi

04/13/2018
by   Priya Rani, et al.
0

Language identification has become a prerequisite for all kinds of automated text processing systems. In this paper, we present a rule-based language identifier tool for two closely related Indo-Aryan languages: Hindi and Magahi. This system has currently achieved an accuracy of approx 86.34 improve this in the future. Automatic identification of languages will be significant in the accuracy of output of Web Crawlers.

READ FULL TEXT

page 1

page 2

page 3

page 4

03/26/2018

Automatic Identification of Closely-related Indian Languages: Resources and Experiments

In this paper, we discuss an attempt to develop an automatic language id...
02/11/2021

A reproduction of Apple's bi-directional LSTM models for language identification in short strings

Language Identification is the task of identifying a document's language...
10/21/2022

AfroLID: A Neural Language Identification Tool for African Languages

Language identification (LID) is a crucial precursor for NLP, especially...
04/03/2021

From n-grams to trees in Lindenmayer systems

In this paper we present two approaches to Lindenmayer systems: the rule...
01/13/2017

LIDE: Language Identification from Text Documents

The increase in the use of microblogging came along with the rapid growt...
11/29/2018

Tuplemax Loss for Language Identification

In many scenarios of a language identification task, the user will speci...
02/24/2021

Automatic Meter Classification of Kurdish Poems

Most of the classic texts in Kurdish literature are poems. Knowing the m...