Application of Low-resource Machine Translation Techniques to Russian-Tatar Language Pair

10/01/2019
by   Aidar Valeev, et al.
0

Neural machine translation is the current state-of-the-art in machine translation. Although it is successful in a resource-rich setting, its applicability for low-resource language pairs is still debatable. In this paper, we explore the effect of different techniques to improve machine translation quality when a parallel corpus is as small as 324 000 sentences, taking as an example previously unexplored Russian-Tatar language pair. We apply such techniques as transfer learning and semi-supervised learning to the base Transformer model, and empirically show that the resulting models improve Russian to Tatar and Tatar to Russian translation quality by +2.57 and +3.66 BLEU, respectively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/08/2016

Transfer Learning for Low-Resource Neural Machine Translation

The encoder-decoder framework for neural machine translation (NMT) has b...
research
09/26/2017

Improving a Multi-Source Neural Machine Translation Model with Corpus Extension for Low-Resource Languages

In machine translation, we often try to collect resources to improve its...
research
01/12/2020

Urdu-English Machine Transliteration using Neural Networks

Machine translation has gained much attention in recent years. It is a s...
research
03/19/2021

Congolese Swahili Machine Translation for Humanitarian Response

In this paper we describe our efforts to make a bidirectional Congolese ...
research
04/02/2023

Semi-supervised Neural Machine Translation with Consistency Regularization for Low-Resource Languages

The advent of deep learning has led to a significant gain in machine tra...
research
11/06/2020

Semi-Supervised Low-Resource Style Transfer of Indonesian Informal to Formal Language with Iterative Forward-Translation

In its daily use, the Indonesian language is riddled with informality, t...
research
10/08/2020

Query-Key Normalization for Transformers

Low-resource language translation is a challenging but socially valuable...

Please sign up or login with your details

Forgot password? Click here to reset