Log In Sign Up

Controlling Text Complexity in Neural Machine Translation

by   Sweta Agrawal, et al.

This work introduces a machine translation task where the output is aimed at audiences of different levels of target language proficiency. We collect a high quality dataset of news articles available in English and Spanish, written for diverse grade levels and propose a method to align segments across comparable bilingual articles. The resulting dataset makes it possible to train multi-task sequence-to-sequence models that translate Spanish into English targeted at an easier reading grade level than the original Spanish. We show that these multi-task models outperform pipeline approaches that translate and simplify text independently.


page 1

page 2

page 3

page 4


JUCBNMT at WMT2018 News Translation Task: Character Based Neural Machine Translation of Finnish to English

In the current work, we present a description of the system submitted to...

JUMT at WMT2019 News Translation Task: A Hybrid approach to Machine Translation for Lithuanian to English

In the current work, we present a description of the system submitted to...

NMTPY: A Flexible Toolkit for Advanced Neural Machine Translation Systems

In this paper, we present nmtpy, a flexible Python toolkit based on Thea...

A Case Study: Exploiting Neural Machine Translation to Translate CUDA to OpenCL

The sequence-to-sequence (seq2seq) model for neural machine translation ...

Controlling Neural Machine Translation Formality with Synthetic Supervision

This work aims to produce translations that convey source language conte...

Lost in Machine Translation: A Method to Reduce Meaning Loss

A desideratum of high-quality translation systems is that they preserve ...