Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation

10/21/2014
by   Michael Bloodgood, et al.
0

We explore how to improve machine translation systems by adding more translation data in situations where we already have substantial resources. The main challenge is how to buck the trend of diminishing returns that is commonly encountered. We present an active learning-style data solicitation algorithm to meet this challenge. We test it, gathering annotations via Amazon Mechanical Turk, and find that we get an order of magnitude increase in performance rates of improvement.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/30/2018

Active Learning for Interactive Neural Machine Translation of Data Streams

We study the application of active learning techniques to the translatio...
research
10/27/2022

COMET-QE and Active Learning for Low-Resource Machine Translation

Active learning aims to deliver maximum benefit when resources are scarc...
research
12/30/2022

Active Learning for Neural Machine Translation

The machine translation mechanism translates texts automatically between...
research
03/09/2022

Onception: Active Learning with Expert Advice for Real World Machine Translation

Active learning can play an important role in low-resource settings (i.e...
research
06/21/2021

Phrase-level Active Learning for Neural Machine Translation

Neural machine translation (NMT) is sensitive to domain shift. In this p...
research
08/20/2019

A Lost Croatian Cybernetic Machine Translation Program

We are exploring the historical significance of research in the field of...
research
12/22/2014

Bayesian Optimisation for Machine Translation

This paper presents novel Bayesian optimisation algorithms for minimum e...

Please sign up or login with your details

Forgot password? Click here to reset