DeepAI AI Chat
Log In Sign Up

Towards Recognizing Phrase Translation Processes: Experiments on English-French

by   Yuming Zhai, et al.

When translating phrases (words or group of words), human translators, consciously or not, resort to different translation processes apart from the literal translation, such as Idiom Equivalence, Generalization, Particularization, Semantic Modulation, etc. Translators and linguists (such as Vinay and Darbelnet, Newmark, etc.) have proposed several typologies to characterize the different translation processes. However, to the best of our knowledge, there has not been effort to automatically classify these fine-grained translation processes. Recently, an English-French parallel corpus of TED Talks has been manually annotated with translation process categories, along with established annotation guidelines. Based on these annotated examples, we propose an automatic classification of translation processes at subsentential level. Experimental results show that we can distinguish non-literal translation from literal translation with an accuracy of 87.09 and 55.20 work demonstrates that it is possible to automatically classify translation processes. Even with a small amount of annotated examples, our experiments show the directions that we can follow in future work. One of our long term objectives is leveraging this automatic classification to better control paraphrase extraction from bilingual parallel corpora.


page 1

page 2

page 3

page 4


Progressive Multi-Granularity Training for Non-Autoregressive Translation

Non-autoregressive translation (NAT) significantly accelerates the infer...

The Monitor Model and its Misconceptions: A Clarification

Horizontal (automatic) and vertical (control) processes have long been r...

The UN Parallel Corpus Annotated for Translation Direction

This work distinguishes between translated and original text in the UN p...

WeTS: A Benchmark for Translation Suggestion

Translation Suggestion (TS), which provides alternatives for specific wo...

Phrase Pair Mappings for Hindi-English Statistical Machine Translation

In this paper, we present our work on the creation of lexical resources ...

chi2TeX Semi-automatic translation from chiwriter to LaTeX

Semi-automatic translation of math-filled book from obsolete ChiWriter f...

Understanding the Impact of UGC Specificities on Translation Quality

This work takes a critical look at the evaluation of user-generated cont...