Collaborative construction of lexicographic and parallel datasets for African languages: first assessment

03/30/2021
by   Elvis Mboning Tchiaze, et al.
0

Faced with a considerable lack of resources in African languages to carry out work in Natural Language Processing (NLP), Natural Language Understanding (NLU) and artificial intelligence, the research teams of NTeALan association has set itself the objective of building open-source platforms for the collaborative construction of lexicographic data in African languages. In this article, we present our first reports after 2 years of collaborative construction of lexicographic resources useful for African NLP tools.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/19/2022

NusaCrowd: Open Source Initiative for Indonesian NLP Resources

We present NusaCrowd, a collaborative initiative to collect and unite ex...
research
05/25/2020

MaintNet: A Collaborative Open-Source Library for Predictive Maintenance Language Resources

Maintenance record logbooks are an emerging text type in NLP. They typic...
research
07/21/2022

NusaCrowd: A Call for Open and Reproducible NLP Research in Indonesian Languages

At the center of the underlying issues that halt Indonesian natural lang...
research
03/13/2020

Masakhane – Machine Translation For Africa

Africa has over 2000 languages. Despite this, African languages account ...
research
02/01/2021

Gamified Crowdsourcing for Idiom Corpora Construction

Learning idiomatic expressions is seen as one of the most challenging st...
research
08/10/2016

An assessment of orthographic similarity measures for several African languages

Natural Language Interfaces and tools such as spellcheckers and Web sear...
research
12/11/2019

A Collaborative Ecosystem for Digital Coptic Studies

Scholarship on underresourced languages bring with them a variety of cha...

Please sign up or login with your details

Forgot password? Click here to reset