Neural-based machine translation for medical text domain. Based on European Medicines Agency leaflet texts

09/29/2015
by   Krzysztof Wołk, et al.
0

The quality of machine translation is rapidly evolving. Today one can find several machine translation systems on the web that provide reasonable translations, although the systems are not perfect. In some specific domains, the quality may decrease. A recently proposed approach to this domain is neural machine translation. It aims at building a jointly-tuned single neural network that maximizes translation performance, a very different approach from traditional statistical machine translation. Recently proposed neural machine translation models often belong to the encoder-decoder family in which a source sentence is encoded into a fixed length vector that is, in turn, decoded to generate a translation. The present research examines the effects of different training methods on a Polish-English Machine Translation system used for medical data. The European Medicines Agency parallel text corpus was used as the basis for training of neural and statistical network-based translation systems. The main machine translation evaluation metrics have also been used in analysis of the systems. A comparison and implementation of a real-time medical translator is the main focus of our experiments.

READ FULL TEXT
research
09/01/2014

Neural Machine Translation by Jointly Learning to Align and Translate

Neural machine translation is a recently proposed approach to machine tr...
research
10/23/2022

Additive Interventions Yield Robust Multi-Domain Machine Translation Models

Additive interventions are a recently-proposed mechanism for controlling...
research
12/04/2020

A Benchmark Dataset for Understandable Medical Language Translation

In this paper, we introduce MedLane – a new human-annotated Medical Lang...
research
08/10/2017

Neural and Statistical Methods for Leveraging Meta-information in Machine Translation

In this paper, we discuss different methods which use meta information a...
research
04/09/2019

Data Selection with Cluster-Based Language Difference Models and Cynical Selection

We present and apply two methods for addressing the problem of selecting...
research
10/26/2020

Data Troubles in Sentence Level Confidence Estimation for Machine Translation

The paper investigates the feasibility of confidence estimation for neur...
research
09/29/2015

Polish -English Statistical Machine Translation of Medical Texts

This new research explores the effects of various training methods on a ...

Please sign up or login with your details

Forgot password? Click here to reset