How do lexical semantics affect translation? An empirical study

12/31/2021
by   Vivek Subramanian, et al.
0

Neural machine translation (NMT) systems aim to map text from one language into another. While there are a wide variety of applications of NMT, one of the most important is translation of natural language. A distinguishing factor of natural language is that words are typically ordered according to the rules of the grammar of a given language. Although many advances have been made in developing NMT systems for translating natural language, little research has been done on understanding how the word ordering of and lexical similarity between the source and target language affect translation performance. Here, we investigate these relationships on a variety of low-resource language pairs from the OpenSubtitles2016 database, where the source language is English, and find that the more similar the target language is to English, the greater the translation performance. In addition, we study the impact of providing NMT models with part of speech of words (POS) in the English sequence and find that, for Transformer-based models, the more dissimilar the target language is from English, the greater the benefit provided by POS.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/17/2022

Improving English to Sinhala Neural Machine Translation using Part-of-Speech Tag

The performance of Neural Machine Translation (NMT) depends significantl...
research
09/27/2021

Towards Reinforcement Learning for Pivot-based Neural Machine Translation with Non-autoregressive Transformer

Pivot-based neural machine translation (NMT) is commonly used in low-res...
research
02/28/2022

The impact of lexical and grammatical processing on generating code from natural language

Considering the seq2seq architecture of TranX for natural language to co...
research
02/03/2023

Lexical Simplification using multi level and modular approach

Text Simplification is an ongoing problem in Natural Language Processing...
research
10/29/2019

Findings of the Third Workshop on Neural Generation and Translation

This document describes the findings of the Third Workshop on Neural Gen...
research
05/17/2020

Encodings of Source Syntax: Similarities in NMT Representations Across Target Languages

We train neural machine translation (NMT) models from English to six tar...
research
08/16/2023

Fast Training of NMT Model with Data Sorting

The Transformer model has revolutionized Natural Language Processing tas...

Please sign up or login with your details

Forgot password? Click here to reset