Enabling Medical Translation for Low-Resource Languages

10/09/2016
by   Ahmad Musleh, et al.
0

We present research towards bridging the language gap between migrant workers in Qatar and medical staff. In particular, we present the first steps towards the development of a real-world Hindi-English machine translation system for doctor-patient communication. As this is a low-resource language pair, especially for speech and for the medical domain, our initial focus has been on gathering suitable training data from various sources. We applied a variety of methods ranging from fully automatic extraction from the Web to manual annotation of test data. Moreover, we developed a method for automatically augmenting the training data with synthetically generated variants, which yielded a very sizable improvement of more than 3 BLEU points absolute.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/10/2020

Neural Machine Translation for Extremely Low-Resource African Languages: A Case Study on Bambara

Low-resource languages present unique challenges to (neural) machine tra...
research
10/24/2022

Development of Hybrid ASR Systems for Low Resource Medical Domain Conversational Telephone Speech

Language barriers present a great challenge in our increasingly connecte...
research
09/01/2021

Survey of Low-Resource Machine Translation

We present a survey covering the state of the art in low-resource machin...
research
07/28/2018

Domain Robust Feature Extraction for Rapid Low Resource ASR Development

Developing a practical speech recognizer for a low resource language is ...
research
12/05/2022

Impact of Domain-Adapted Multilingual Neural Machine Translation in the Medical Domain

Multilingual Neural Machine Translation (MNMT) models leverage many lang...
research
08/30/2023

Cyberbullying Detection for Low-resource Languages and Dialects: Review of the State of the Art

The struggle of social media platforms to moderate content in a timely m...
research
07/30/2023

A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction

This paper focuses on term-status pair extraction from medical dialogues...

Please sign up or login with your details

Forgot password? Click here to reset