Machine Translation for Machines: the Sentiment Classification Use Case

10/01/2019
by   Amirhossein Tebbifakhr, et al.
0

We propose a neural machine translation (NMT) approach that, instead of pursuing adequacy and fluency ("human-oriented" quality criteria), aims to generate translations that are best suited as input to a natural language processing component designed for a specific downstream task (a "machine-oriented" criterion). Towards this objective, we present a reinforcement learning technique based on a new candidate sampling strategy, which exploits the results obtained on the downstream task as weak feedback. Experiments in sentiment classification of Twitter data in German and Italian show that feeding an English classifier with machine-oriented translations significantly improves its performance. Classification results outperform those obtained with translations produced by general-purpose NMT models as well as by an approach based on reinforcement learning. Moreover, our results on both languages approximate the classification accuracy computed on gold standard English tweets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/17/2016

Pre-Translation for Neural Machine Translation

Recently, the development of neural machine translation (NMT) has signif...
research
03/28/2021

PENELOPIE: Enabling Open Information Extraction for the Greek Language through Machine Translation

In this paper we present our submission for the EACL 2021 SRW; a methodo...
research
07/24/2017

Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback

Machine translation is a natural candidate problem for reinforcement lea...
research
11/01/2016

Dual Learning for Machine Translation

While neural machine translation (NMT) is making good progress in the pa...
research
04/23/2020

Correct Me If You Can: Learning from Error Corrections and Markings

Sequence-to-sequence learning involves a trade-off between signal streng...
research
05/26/2021

Joint Optimization of Tokenization and Downstream Model

Since traditional tokenizers are isolated from a downstream task and mod...
research
07/23/2021

Modelling Latent Translations for Cross-Lingual Transfer

While achieving state-of-the-art results in multiple tasks and languages...

Please sign up or login with your details

Forgot password? Click here to reset