SYSTRAN's Pure Neural Machine Translation Systems

10/18/2016
by   Josep Crego, et al.
0

Since the first online demonstration of Neural Machine Translation (NMT) by LISA, NMT development has recently moved from laboratory to production systems as demonstrated by several entities announcing roll-out of NMT engines to replace their existing technologies. NMT systems have a large number of training configurations and the training process of such systems is usually very long, often a few weeks, so role of experimentation is critical and important to share. In this work, we present our approach to production-ready systems simultaneously with release of online demonstrators covering a large variety of languages (12 languages, for 32 language pairs). We explore different practical choices: an efficient and evolutive open-source framework; data preparation; network architecture; additional implemented features; tuning for production; etc. We discuss about evaluation methodology, present our first findings and we finally outline further work. Our ultimate goal is to share our expertise to build competitive production systems for "generic" translation. We aim at contributing to set up a collaborative framework to speed-up adoption of the technology, foster further research efforts and enable the delivery and adoption to/by industry of use-case specific engines integrated in real production workflows. Mastering of the technology would allow us to build translation engines suited for particular needs, outperforming current simplest/uniform systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

07/25/2017

Machine Translation at Booking.com: Journey and Lessons Learned

We describe our recently developed neural machine translation (NMT) syst...
09/18/2017

Toward a full-scale neural machine translation in production: the Booking.com use case

While some remarkable progress has been made in neural machine translati...
05/28/2018

OpenNMT: Neural Machine Translation Toolkit

OpenNMT is an open-source toolkit for neural machine translation (NMT). ...
07/12/2022

Sockeye 3: Fast Neural Machine Translation with PyTorch

Sockeye 3 is the latest version of the Sockeye toolkit for Neural Machin...
03/11/2017

Massive Exploration of Neural Machine Translation Architectures

Neural Machine Translation (NMT) has shown remarkable progress over the ...
09/01/2017

Kafka versus RabbitMQ

Publish/subscribe is a distributed interaction paradigm well adapted to ...