Using CollGram to Compare Formulaic Language in Human and Neural Machine Translation

07/08/2021
by   Yves Bestgen, et al.
0

A comparison of formulaic sequences in human and neural machine translation of quality newspaper articles shows that neural machine translations contain less lower-frequency, but strongly-associated formulaic sequences, and more high-frequency formulaic sequences. These differences were statistically significant and the effect sizes were almost always medium or large. These observations can be related to the differences between second language learners of various levels and between translated and untranslated texts. The comparison between the neural machine translation systems indicates that some systems produce more formulaic sequences of both types than other systems.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset