Masakhane – Machine Translation For Africa

03/13/2020
by   Iroro Orife, et al.
0

Africa has over 2000 languages. Despite this, African languages account for a small portion of available resources and publications in Natural Language Processing (NLP). This is due to multiple factors, including: a lack of focus from government and funding, discoverability, a lack of community, sheer language complexity, difficulty in reproducing papers and no benchmarks to compare techniques. To begin to address the identified problems, MASAKHANE, an open-source, continent-wide, distributed, online research effort for machine translation for African languages, was founded. In this paper, we discuss our methodology for building the community and spurring research from the African continent, as well as outline the success of the community in terms of addressing the identified problems affecting African NLP.

READ FULL TEXT
research
05/31/2022

NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages

Natural language processing (NLP) has a significant impact on society vi...
research
08/03/2020

Lanfrica: A Participatory Approach to Documenting Machine Translation Research on African Languages

Over the years, there have been campaigns to include the African languag...
research
03/30/2021

Collaborative construction of lexicographic and parallel datasets for African languages: first assessment

Faced with a considerable lack of resources in African languages to carr...
research
03/29/2021

NLP for Ghanaian Languages

NLP Ghana is an open-source non-profit organization aiming to advance th...
research
04/01/2020

Igbo-English Machine Translation: An Evaluation Benchmark

Although researchers and practitioners are pushing the boundaries and en...
research
04/25/2022

A global analysis of metrics used for measuring performance in natural language processing

Measuring the performance of natural language processing models is chall...
research
04/25/2022

How can NLP Help Revitalize Endangered Languages? A Case Study and Roadmap for the Cherokee Language

More than 43 language loss currently occurs at an accelerated rate becau...

Please sign up or login with your details

Forgot password? Click here to reset