Polish -English Statistical Machine Translation of Medical Texts

09/29/2015
by   Krzysztof Wołk, et al.
0

This new research explores the effects of various training methods on a Polish to English Statistical Machine Translation system for medical texts. Various elements of the EMEA parallel text corpora from the OPUS project were used as the basis for training of phrase tables and language models and for development, tuning and testing of the translation system. The BLEU, NIST, METEOR, RIBES and TER metrics have been used to evaluate the effects of various system and data preparations on translation results. Our experiments included systems that used POS tagging, factored phrase models, hierarchical models, syntactic taggers, and many different alignment methods. We also conducted a deep analysis of Polish data as preparatory work for automatic data correction such as true casing and punctuation normalization phase.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/30/2015

Polish - English Speech Statistical Machine Translation Systems for the IWSLT 2013

This research explores the effects of various training settings from Pol...
research
10/15/2015

Telemedicine as a special case of Machine Translation

Machine translation is evolving quite rapidly in terms of quality. Nowad...
research
09/29/2015

Polish - English Speech Statistical Machine Translation Systems for the IWSLT 2014

This research explores effects of various training settings between Poli...
research
09/29/2015

Neural-based machine translation for medical text domain. Based on European Medicines Agency leaflet texts

The quality of machine translation is rapidly evolving. Today one can fi...
research
09/30/2015

Real-Time Statistical Speech Translation

This research investigates the Statistical Machine Translation approache...
research
10/05/2017

Phrase Pair Mappings for Hindi-English Statistical Machine Translation

In this paper, we present our work on the creation of lexical resources ...
research
09/17/2013

Exploiting Similarities among Languages for Machine Translation

Dictionaries and phrase tables are the basis of modern statistical machi...

Please sign up or login with your details

Forgot password? Click here to reset