Translator2Vec: Understanding and Representing Human Post-Editors

07/24/2019
by   António Góis, et al.
0

The combination of machines and humans for translation is effective, with many studies showing productivity gains when humans post-edit machine-translated output instead of translating from scratch. To take full advantage of this combination, we need a fine-grained understanding of how human translators work, and which post-editing styles are more effective than others. In this paper, we release and analyze a new dataset with document-level post-editing action sequences, including edit operations from keystrokes, mouse actions, and waiting times. Our dataset comprises 66,268 full document sessions post-edited by 332 humans, the largest of the kind released to date. We show that action sequences are informative enough to identify post-editors accurately, compared to baselines that only look at the initial and final text. We build on this to learn and visualize continuous representations of post-editors, and we show that these representations improve the downstream task of predicting post-editing time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/25/2021

IntelliCAT: Intelligent Machine Translation Post-Editing with Quality Estimation and Translation Suggestion

We present IntelliCAT, an interactive translation interface with neural ...
research
12/18/2021

Assessing Post-editing Effort in the English-Hindi Direction

We present findings from a first in-depth post-editing effort estimation...
research
06/05/2019

Visual Story Post-Editing

We introduce the first dataset for human edits of machine-generated visu...
research
07/31/2018

Manual Post-editing of Automatically Transcribed Speeches from the Icelandic Parliament - Althingi

The design objectives for an automatic transcription system are to produ...
research
09/20/2021

Latexify Math: Mathematical Formula Markup Revision to Assist Collaborative Editing in Math Q A Sites

Collaborative editing questions and answers plays an important role in q...
research
02/20/2017

Post-edit Analysis of Collective Biography Generation

Text generation is increasingly common but often requires manual post-ed...
research
09/21/2022

PePe: Personalized Post-editing Model utilizing User-generated Post-edits

Incorporating personal preference is crucial in advanced machine transla...

Please sign up or login with your details

Forgot password? Click here to reset