Lanfrica: A Participatory Approach to Documenting Machine Translation Research on African Languages

08/03/2020
by   Chris C. Emezue, et al.
0

Over the years, there have been campaigns to include the African languages in the growing research on machine translation (MT) in particular, and natural language processing (NLP) in general. Africa has the highest language diversity, with 1500-2000 documented languages and many more undocumented or extinct languages(Lewis, 2009; Bendor-Samuel, 2017). This makes it hard to keep track of the MT research, models and dataset that have been developed for some of them. As the internet and social media make up the daily lives of more than half of the world(Lin, 2020), as well as over 40 online platforms can be useful in creating accessibility to researches, benchmarks and datasets in these African languages, thereby improving reproducibility and sharing of existing research and their results. In this paper, we introduce Lanfrica, a novel, on-going framework that employs a participatory approach to documenting researches, projects, benchmarks and dataset on African languages.

READ FULL TEXT
research
10/05/2020

Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages

Research in NLP lacks geographic diversity, and the question of how NLP ...
research
10/11/2022

Machine Translation between Spoken Languages and Signed Languages Represented in SignWriting

This paper presents work on novel machine translation (MT) systems betwe...
research
03/13/2020

Masakhane – Machine Translation For Africa

Africa has over 2000 languages. Despite this, African languages account ...
research
12/28/2018

Machine Translation: A Literature Review

Machine translation (MT) plays an important role in benefiting linguists...
research
02/13/2021

The first large scale collection of diverse Hausa language datasets

Hausa language belongs to the Afroasiatic phylum, and with more first-la...
research
03/26/2020

FFR V1.0: Fon-French Neural Machine Translation

Africa has the highest linguistic diversity in the world. On account of ...

Please sign up or login with your details

Forgot password? Click here to reset