UDAAN - Machine Learning based Post-Editing tool for Document Translation

03/03/2022
by   Ayush Maheshwari, et al.
0

We introduce UDAAN, an open-source post-editing tool that can reduce manual editing efforts to quickly produce publishable-standard documents in different languages. UDAAN has an end-to-end Machine Translation (MT) plus post-editing pipeline wherein users can upload a document to obtain raw MT output. Further, users can edit the raw translations using our tool. UDAAN offers several advantages: a) Domain-aware, vocabulary-based lexical constrained MT. b) source-target and target-target lexicon suggestions for users. Replacements are based on the source and target texts lexicon alignment. c) Suggestions for translations are based on logs created during user interaction. d) Source-target sentence alignment visualisation that reduces the cognitive load of users during editing. e) Translated outputs from our tool are available in multiple formats: docs, latex, and PDF. Although we limit our experiments to English-to-Hindi translation for the current study, our tool is independent of the source and target languages. Experimental results based on the usage of the tools and users feedback show that our tool speeds up the translation time approximately by a factor of three compared to the baseline method of translating documents from scratch.

READ FULL TEXT

page 2

page 5

research
08/16/2019

Improving CAT Tools in the Translation Workflow: New Approaches and Evaluation

This paper describes strategies to improve an existing web-based compute...
research
04/25/2021

Automatic Post-Editing for Translating Chinese Novels to Vietnamese

Automatic post-editing (APE) is an important remedy for reducing errors ...
research
10/24/2022

Bilingual Synchronization: Restoring Translational Relationships with Editing Operations

Machine Translation (MT) is usually viewed as a one-shot process that ge...
research
04/12/2022

Creativity in translation: machine translation as a constraint for literary texts

This article presents the results of a study involving the translation o...
research
08/15/2019

Transformer-based Automatic Post-Editing with a Context-Aware Encoding Approach for Multi-Source Inputs

Recent approaches to the Automatic Post-Editing (APE) research have show...
research
07/01/2019

Post-editese: an Exacerbated Translationese

Post-editing (PE) machine translation (MT) is widely used for disseminat...
research
06/30/2021

Learning a Reversible Embedding Mapping using Bi-Directional Manifold Alignment

We propose a Bi-Directional Manifold Alignment (BDMA) that learns a non-...

Please sign up or login with your details

Forgot password? Click here to reset