Benchmarking Neural Machine Translation for Southern African Languages

06/17/2019
by   Laura Martinus, et al.
0

Unlike major Western languages, most African languages are very low-resourced. Furthermore, the resources that do exist are often scattered and difficult to obtain and discover. As a result, the data and code for existing research has rarely been shared. This has lead a struggle to reproduce reported results, and few publicly available benchmarks for African machine translation models exist. To start to address these problems, we trained neural machine translation models for 5 Southern African languages on publicly-available datasets. Code is provided for training the models and evaluate the models on a newly released evaluation set, with the aim of spur future research in the field for Southern African languages.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/11/2019

A Focus on Neural Machine Translation for African Languages

African languages are numerous, complex and low-resourced. The datasets ...
research
08/17/2021

Learning C to x86 Translation: An Experiment in Neural Compilation

Deep learning has had a significant impact on many fields. Recently, cod...
research
12/17/2022

Beyond the C: Retargetable Decompilation using Neural Machine Translation

The problem of reversing the compilation process, decompilation, is an i...
research
03/26/2020

FFR V1.0: Fon-French Neural Machine Translation

Africa has the highest linguistic diversity in the world. On account of ...
research
11/26/2019

Generating Commit Messages from Git Diffs

Commit messages aid developers in their understanding of a continuously ...
research
03/24/2020

Towards Neural Machine Translation for Edoid Languages

Many Nigerian languages have relinquished their previous prestige and pu...
research
08/11/2020

Revisiting Low Resource Status of Indian Languages in Machine Translation

Indian language machine translation performance is hampered due to the l...

Please sign up or login with your details

Forgot password? Click here to reset