Distance Based Source Domain Selection for Sentiment Classification

08/28/2018
by   Lex Razoux Schultz, et al.
0

Automated sentiment classification (SC) on short text fragments has received increasing attention in recent years. Performing SC on unseen domains with few or no labeled samples can significantly affect the classification performance due to different expression of sentiment in source and target domain. In this study, we aim to mitigate this undesired impact by proposing a methodology based on a predictive measure, which allows us to select an optimal source domain from a set of candidates. The proposed measure is a linear combination of well-known distance functions between probability distributions supported on the source and target domains (e.g. Earth Mover's distance and Kullback-Leibler divergence). The performance of the proposed methodology is validated through an SC case study in which our numerical experiments suggest a significant improvement in the cross domain classification error in comparison with a random selected source domain for both a naive and adaptive learning setting. In the case of more heterogeneous datasets, the predictability feature of the proposed model can be utilized to further select a subset of candidate domains, where the corresponding classifier outperforms the one trained on all available source domains. This observation reinforces a hypothesis that our proposed model may also be deployed as a means to filter out redundant information during a training phase of SC.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/04/2021

Domain Adaptation for Sentiment Analysis Using Increased Intraclass Separation

Sentiment analysis is a costly yet necessary task for enterprises to stu...
research
09/03/2018

Adaptive Semi-supervised Learning for Cross-domain Sentiment Classification

We consider the cross-domain sentiment classification problem, where a s...
research
04/09/2020

Recommendation Chart of Domains for Cross-Domain Sentiment Analysis:Findings of A 20 Domain Study

Cross-domain sentiment analysis (CDSA) helps to address the problem of d...
research
11/28/2021

Topic Driven Adaptive Network for Cross-Domain Sentiment Classification

Cross-domain sentiment classification has been a hot spot these years, w...
research
11/17/2020

Curriculum CycleGAN for Textual Sentiment Domain Adaptation with Multiple Sources

Sentiment analysis of user-generated reviews or comments on products and...
research
02/01/2020

Improving Domain-Adapted Sentiment Classification by Deep Adversarial Mutual Learning

Domain-adapted sentiment classification refers to training on a labeled ...
research
09/25/2017

"Let me convince you to buy my product ... ": A Case Study of an Automated Persuasive System for Fashion Products

Persuasivenes is a creative art aimed at making people believe in certai...

Please sign up or login with your details

Forgot password? Click here to reset