Automatic Language Identification System for Hindi and Magahi

04/13/2018
by   Priya Rani, et al.
0

Language identification has become a prerequisite for all kinds of automated text processing systems. In this paper, we present a rule-based language identifier tool for two closely related Indo-Aryan languages: Hindi and Magahi. This system has currently achieved an accuracy of approx 86.34 improve this in the future. Automatic identification of languages will be significant in the accuracy of output of Web Crawlers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/26/2018

Automatic Identification of Closely-related Indian Languages: Resources and Experiments

In this paper, we discuss an attempt to develop an automatic language id...
research
02/11/2021

A reproduction of Apple's bi-directional LSTM models for language identification in short strings

Language Identification is the task of identifying a document's language...
research
10/21/2022

AfroLID: A Neural Language Identification Tool for African Languages

Language identification (LID) is a crucial precursor for NLP, especially...
research
04/03/2021

From n-grams to trees in Lindenmayer systems

In this paper we present two approaches to Lindenmayer systems: the rule...
research
01/13/2017

LIDE: Language Identification from Text Documents

The increase in the use of microblogging came along with the rapid growt...
research
02/24/2021

Automatic Meter Classification of Kurdish Poems

Most of the classic texts in Kurdish literature are poems. Knowing the m...
research
06/09/2022

Language Identification for Austronesian Languages

This paper provides language identification models for low- and under-re...

Please sign up or login with your details

Forgot password? Click here to reset