Log In Sign Up

Preparation of Sentiment tagged Parallel Corpus and Testing its effect on Machine Translation

by   Sainik Kumar Mahata, et al.

In the current work, we explore the enrichment in the machine translation output when the training parallel corpus is augmented with the introduction of sentiment analysis. The paper discusses the preparation of the same sentiment tagged English-Bengali parallel corpus. The preparation of raw parallel corpus, sentiment analysis of the sentences and the training of a Character Based Neural Machine Translation model using the same has been discussed extensively in this paper. The output of the translation model has been compared with a base-line translation model using automated metrics such as BLEU and TER as well as manually.


page 1

page 2

page 3

page 4


Extended Parallel Corpus for Amharic-English Machine Translation

This paper describes the acquisition, preprocessing, segmentation, and a...

Towards Computational Linguistics in Minangkabau Language: Studies on Sentiment Analysis and Machine Translation

Although some linguists (Rusmali et al., 1985; Crouch, 2009) have fairly...

JUCBNMT at WMT2018 News Translation Task: Character Based Neural Machine Translation of Finnish to English

In the current work, we present a description of the system submitted to...

Unpaired Sentiment-to-Sentiment Translation: A Cycled Reinforcement Learning Approach

The goal of sentiment-to-sentiment "translation" is to change the underl...

LiSSS: A toy corpus of Literary Spanish Sentences Sentiment for Emotions Detection

In this work we present a new and small corpus in the area of Computatio...

uniblock: Scoring and Filtering Corpus with Unicode Block Information

The preprocessing pipelines in Natural Language Processing usually invol...

Investigation of Sentiment Controllable Chatbot

Conventional seq2seq chatbot models attempt only to find sentences with ...