Incorporate Semantic Structures into Machine Translation Evaluation via UCCA

10/17/2020
by   Jin Xu, et al.
0

Copying mechanism has been commonly used in neural paraphrasing networks and other text generation tasks, in which some important words in the input sequence are preserved in the output sequence. Similarly, in machine translation, we notice that there are certain words or phrases appearing in all good translations of one source text, and these words tend to convey important semantic information. Therefore, in this work, we define words carrying important semantic meanings in sentences as semantic core words. Moreover, we propose an MT evaluation approach named Semantically Weighted Sentence Similarity (SWSS). It leverages the power of UCCA to identify semantic core words, and then calculates sentence similarity scores on the overlap of semantic core words. Experimental results show that SWSS can consistently improve the performance of popular MT evaluation metrics which are based on lexical similarity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/05/2016

Word2Vec vs DBnary: Augmenting METEOR using Vector Representations or Lexical Resources?

This paper presents an approach combining lexico-semantic resources and ...
research
05/11/2023

Subword Segmental Machine Translation: Unifying Segmentation and Target Sentence Generation

Subword segmenters like BPE operate as a preprocessing step in neural ma...
research
06/30/2016

HUME: Human UCCA-Based Evaluation of Machine Translation

Human evaluation of machine translation normally uses sentence-level mea...
research
12/06/2019

Decomposing predictability: Semantic feature overlap between words and the dynamics of reading for meaning

The present study uses a computational approach to examine the role of s...
research
02/10/2017

Local System Voting Feature for Machine Translation System Combination

In this paper, we enhance the traditional confusion network system combi...
research
03/23/2016

Evaluating semantic models with word-sentence relatedness

Semantic textual similarity (STS) systems are designed to encode and eva...
research
05/24/2022

Principled Paraphrase Generation with Parallel Corpora

Round-trip Machine Translation (MT) is a popular choice for paraphrase g...

Please sign up or login with your details

Forgot password? Click here to reset