Simple Automatic Post-editing for Arabic-Japanese Machine Translation

07/14/2019
by   Ella Noll, et al.
0

A common bottleneck for developing machine translation (MT) systems for some language pairs is the lack of direct parallel translation data sets, in general and in certain domains. Alternative solutions such as zero-shot models or pivoting techniques are successful in getting a strong baseline, but are often below the more supported language-pair systems. In this paper, we focus on Arabic-Japanese machine translation, a less studied language pair; and we work with a unique parallel corpus of Arabic news articles that were manually translated to Japanese. We use this parallel corpus to adapt a state-of-the-art domain/genre agnostic neural MT system via a simple automatic post-editing technique. Our results and detailed analysis suggest that this approach is quite viable for less supported language pairs in specific domains.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/25/2021

Automatic Post-Editing for Translating Chinese Novels to Vietnamese

Automatic post-editing (APE) is an important remedy for reducing errors ...
research
01/09/2023

Automatic Standardization of Arabic Dialects for Machine Translation

Based on an annotated multimedia corpus, television series Marāyā 2013, ...
research
10/03/2016

An Arabic-Hebrew parallel corpus of TED talks

We describe an Arabic-Hebrew parallel corpus of TED talks built upon WIT...
research
11/08/2019

Neural Arabic Text Diacritization: State of the Art Results and a Novel Approach for Machine Translation

In this work, we present several deep learning models for the automatic ...
research
12/18/2017

Low Resourced Machine Translation via Morpho-syntactic Modeling: The Case of Dialectal Arabic

We present the second ever evaluated Arabic dialect-to-dialect machine t...
research
06/09/2020

An Augmented Translation Technique for low Resource language pair: Sanskrit to Hindi translation

Neural Machine Translation (NMT) is an ongoing technique for Machine Tra...
research
11/24/2021

A Self-Supervised Automatic Post-Editing Data Generation Tool

Data building for automatic post-editing (APE) requires extensive and ex...

Please sign up or login with your details

Forgot password? Click here to reset