Language Identification of Devanagari Poems

12/30/2020
by   Priyankit Acharya, et al.
0

Language Identification is a very important part of several text processing pipelines. Extensive research has been done in this field. This paper proposes a procedure for automatic language identification of poems for poem analysis task, consisting of 10 Devanagari based languages of India i.e. Angika, Awadhi, Braj, Bhojpuri, Chhattisgarhi, Garhwali, Haryanvi, Hindi, Magahi, and Maithili. We collated corpora of poems of varying length and studied the similarity of poems among the 10 languages at the lexical level. Finally, various language identification systems based on supervised machine learning and deep learning techniques are applied and evaluated.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/26/2018

Automatic Identification of Closely-related Indian Languages: Resources and Experiments

In this paper, we discuss an attempt to develop an automatic language id...
research
02/27/2023

Language identification as improvement for lip-based biometric visual systems

Language has always been one of humanity's defining characteristics. Vis...
research
02/01/2021

Gamified Crowdsourcing for Idiom Corpora Construction

Learning idiomatic expressions is seen as one of the most challenging st...
research
02/11/2021

A reproduction of Apple's bi-directional LSTM models for language identification in short strings

Language Identification is the task of identifying a document's language...
research
07/01/2021

Machine Learning and Deep Learning for Fixed-Text Keystroke Dynamics

Keystroke dynamics can be used to analyze the way that users type by mea...
research
01/13/2017

LIDE: Language Identification from Text Documents

The increase in the use of microblogging came along with the rapid growt...
research
04/22/2018

Automatic Language Identification in Texts: A Survey

Language identification (LI) is the problem of determining the natural l...

Please sign up or login with your details

Forgot password? Click here to reset