Computational historical linguistics and language diversity in South Asia

03/23/2022
by   Aryaman Arora, et al.
0

South Asia is home to a plethora of languages, many of which severely lack access to new language technologies. This linguistic diversity also results in a research environment conducive to the study of comparative, contact, and historical linguistics – fields which necessitate the gathering of extensive data from many languages. We claim that data scatteredness (rather than scarcity) is the primary obstacle in the development of South Asian language technology, and suggest that the study of language history is uniquely aligned with surmounting this obstacle. We review recent developments in and at the intersection of South Asian NLP and historical-comparative linguistics, describing our and others' current efforts in this area. We also offer new strategies towards breaking the data barrier.

READ FULL TEXT

page 3

page 4

page 7

research
03/16/2022

Towards Afrocentric NLP for African Languages: Where We Are and Where We Can Go

Aligning with ACL 2022 special Theme on "Language Diversity: from Low Re...
research
02/03/2020

Phylogenetic signal in phonotactics

Phylogenetic methods have broad potential in linguistics beyond tree inf...
research
05/21/2018

Computational Historical Linguistics

Computational approaches to historical linguistics have been proposed si...
research
04/20/2020

The State and Fate of Linguistic Diversity and Inclusion in the NLP World

Language technologies contribute to promoting multilingualism and lingui...
research
02/22/2020

Markov Chain Monte-Carlo Phylogenetic Inference Construction in Computational Historical Linguistics

More and more languages in the world are under study nowadays, as a resu...
research
01/03/2014

Quantitative methods for Phylogenetic Inference in Historical Linguistics: An experimental case study of South Central Dravidian

In this paper we examine the usefulness of two classes of algorithms Dis...
research
10/21/2022

Bootstrapping NLP tools across low-resourced African languages: an overview and prospects

Computing and Internet access are substantially growing markets in South...

Please sign up or login with your details

Forgot password? Click here to reset