Igbo-English Machine Translation: An Evaluation Benchmark

04/01/2020
by   Ignatius Ezeani, et al.
0

Although researchers and practitioners are pushing the boundaries and enhancing the capacities of NLP tools and methods, works on African languages are lagging. A lot of focus on well resourced languages such as English, Japanese, German, French, Russian, Mandarin Chinese etc. Over 97 world's 7000 languages, including African languages, are low resourced for NLP i.e. they have little or no data, tools, and techniques for NLP research. For instance, only 5 out of 2965, 0.19 Anthology extracted from the 5 major conferences in 2018 ACL, NAACL, EMNLP, COLING and CoNLL, are affiliated to African institutions. In this work, we discuss our effort toward building a standard machine translation benchmark dataset for Igbo, one of the 3 major Nigerian languages. Igbo is spoken by more than 50 million people globally with over 50 southeastern Nigeria. Igbo is low resourced although there have been some efforts toward developing IgboNLP such as part of speech tagging and diacritic restoration

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/07/2019

PidginUNMT: Unsupervised Neural Machine Translation from West African Pidgin to English

Over 800 languages are spoken across West Africa. Despite the obvious di...
research
05/27/2023

Enhancing Translation for Indigenous Languages: Experiments with Multilingual Models

This paper describes CIC NLP's submission to the AmericasNLP 2023 Shared...
research
03/13/2021

OkwuGbé: End-to-End Speech Recognition for Fon and Igbo

Language is inherent and compulsory for human communication. Whether exp...
research
03/13/2020

Masakhane – Machine Translation For Africa

Africa has over 2000 languages. Despite this, African languages account ...
research
01/02/2023

Statistical Machine Translation for Indic Languages

Machine Translation (MT) system generally aims at automatic representati...
research
05/01/2023

Low-Resourced Machine Translation for Senegalese Wolof Language

Natural Language Processing (NLP) research has made great advancements i...
research
05/06/2022

Bridging the Domain Gap for Stance Detection for the Zulu language

Misinformation has become a major concern in recent last years given its...

Please sign up or login with your details

Forgot password? Click here to reset