How can NLP Help Revitalize Endangered Languages? A Case Study and Roadmap for the Cherokee Language

04/25/2022
by   Shiyue Zhang, et al.
1

More than 43 language loss currently occurs at an accelerated rate because of globalization and neocolonialism. Saving and revitalizing endangered languages has become very important for maintaining the cultural diversity on our planet. In this work, we focus on discussing how NLP can help revitalize endangered languages. We first suggest three principles that may help NLP practitioners to foster mutual understanding and collaboration with language communities, and we discuss three ways in which NLP can potentially assist in language education. We then take Cherokee, a severely-endangered Native American language, as a case study. After reviewing the language's history, linguistic features, and existing resources, we (in collaboration with Cherokee community members) arrive at a few meaningful ways NLP practitioners can collaborate with community partners. We suggest two approaches to enrich the Cherokee language's resources with machine-in-the-loop processing, and discuss several NLP tools that people from the Cherokee community have shown interest in. We hope that our work serves not only to inform the NLP community about Cherokee, but also to provide inspiration for future work on endangered languages in general. Our code and data will be open-sourced at https://github.com/ZhangShiyue/RevitalizeCherokee

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/21/2022

NusaCrowd: A Call for Open and Reproducible NLP Research in Indonesian Languages

At the center of the underlying issues that halt Indonesian natural lang...
research
03/17/2022

Dim Wihl Gat Tun: The Case for Linguistic Expertise in NLP for Underdocumented Languages

Recent progress in NLP is driven by pretrained models leveraging massive...
research
09/20/2022

NLP for Language Varieties of Italy: Challenges and the Path Forward

Italy is characterized by a one-of-a-kind linguistic diversity landscape...
research
09/29/2020

Utility is in the Eye of the User: A Critique of NLP Leaderboards

Benchmarks such as GLUE have helped drive advances in NLP by incentivizi...
research
03/02/2022

Mukayese: Turkish NLP Strikes Back

Having sufficient resources for language X lifts it from the under-resou...
research
03/13/2020

Masakhane – Machine Translation For Africa

Africa has over 2000 languages. Despite this, African languages account ...
research
10/02/2022

Community Learning: Understanding A Community Through NLP for Positive Impact

A post-pandemic world resulted in economic upheaval, particularly for th...

Please sign up or login with your details

Forgot password? Click here to reset