Sentiment-based Candidate Selection for NMT

04/10/2021
by   Alex Jones, et al.
0

The explosion of user-generated content (UGC)–e.g. social media posts, comments, and reviews–has motivated the development of NLP applications tailored to these types of informal texts. Prevalent among these applications have been sentiment analysis and machine translation (MT). Grounded in the observation that UGC features highly idiomatic, sentiment-charged language, we propose a decoder-side approach that incorporates automatic sentiment scoring into the MT candidate selection process. We train separate English and Spanish sentiment classifiers, then, using n-best candidates generated by a baseline MT model with beam search, select the candidate that minimizes the absolute difference between the sentiment score of the source sentence and that of the translation, and perform a human evaluation to assess the produced translations. Unlike previous work, we select this minimally divergent translation by considering the sentiment scores of the source sentence and translation on a continuous interval, rather than using e.g. binary classification, allowing for more fine-grained selection of translation candidates. The results of human evaluations show that, in comparison to the open-source MT baseline model on top of which our sentiment-based pipeline is built, our pipeline produces more accurate translations of colloquial, sentiment-heavy source texts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/30/2021

Sentiment-Aware Measure (SAM) for Evaluating Sentiment Transfer by Machine Translation Systems

In translating text where sentiment is the main message, human translato...
research
09/29/2021

BLEU, METEOR, BERTScore: Evaluation of Metrics Performance in Assessing Critical Translation Errors in Sentiment-oriented Text

Social media companies as well as authorities make extensive use of arti...
research
10/21/2022

A Semi-supervised Approach for a Better Translation of Sentiment in Dialectical Arabic UGT

In the online world, Machine Translation (MT) systems are extensively us...
research
11/23/2016

ATR4S: Toolkit with State-of-the-art Automatic Terms Recognition Methods in Scala

Automatically recognized terminology is widely used for various domain-s...
research
08/25/2020

The Impact of Indirect Machine Translation on Sentiment Classification

Sentiment classification has been crucial for many natural language proc...
research
09/02/2018

MTNT: A Testbed for Machine Translation of Noisy Text

Noisy or non-standard input text can cause disastrous mistranslations in...

Please sign up or login with your details

Forgot password? Click here to reset