Bootstrapping NLP tools across low-resourced African languages: an overview and prospects

10/21/2022
by   C. Maria Keet, et al.
0

Computing and Internet access are substantially growing markets in Southern Africa, which brings with it increasing demands for local content and tools in indigenous African languages. Since most of those languages are low-resourced, efforts have gone into the notion of bootstrapping tools for one African language from another. This paper provides an overview of these efforts for Niger-Congo B (`Bantu') languages. Bootstrapping grammars for geographically distant languages has been shown to still have positive outcomes for morphology and rules or grammar-based natural language generation. Bootstrapping with data-driven approaches to NLP tasks is difficult to use meaningfully regardless geographic proximity, which is largely due to lexical diversity due to both orthography and vocabulary. Cladistic approaches in comparative linguistics may inform bootstrapping strategies and similarity measures might serve as proxy for bootstrapping potential as well, with both fertile ground for further research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/10/2016

An assessment of orthographic similarity measures for several African languages

Natural Language Interfaces and tools such as spellcheckers and Web sear...
research
03/18/2022

Challenges and Strategies in Cross-Cultural NLP

Various efforts in the Natural Language Processing (NLP) community have ...
research
03/17/2022

Dim Wihl Gat Tun: The Case for Linguistic Expertise in NLP for Underdocumented Languages

Recent progress in NLP is driven by pretrained models leveraging massive...
research
08/11/2021

Ensuring the Inclusive Use of Natural Language Processing in the Global Response to COVID-19

Natural language processing (NLP) plays a significant role in tools for ...
research
03/29/2021

NLP for Ghanaian Languages

NLP Ghana is an open-source non-profit organization aiming to advance th...
research
03/23/2022

Computational historical linguistics and language diversity in South Asia

South Asia is home to a plethora of languages, many of which severely la...
research
08/14/2016

Proceedings of the LexSem+Logics Workshop 2016

Lexical semantics continues to play an important role in driving researc...

Please sign up or login with your details

Forgot password? Click here to reset