Curriculum Learning Strategies for Hindi-English Codemixed Sentiment Analysis

06/18/2019
by   Anirudh Dahiya, et al.
0

Sentiment Analysis and other semantic tasks are commonly used for social media textual analysis to gauge public opinion and make sense from the noise on social media. The language used on social media not only commonly diverges from the formal language, but is compounded by codemixing between languages, especially in large multilingual societies like India. Traditional methods for learning semantic NLP tasks have long relied on end to end task specific training, requiring expensive data creation process, even more so for deep learning methods. This challenge is even more severe for resource scarce texts like codemixed language pairs, with lack of well learnt representations as model priors, and task specific datasets can be few and small in quantities to efficiently exploit recent deep learning approaches. To address above challenges, we introduce curriculum learning strategies for semantic tasks in code-mixed Hindi-English (Hi-En) texts, and investigate various training strategies for enhancing model performance. Our method outperforms the state of the art methods for Hi-En codemixed sentiment analysis by 3.31 convergence, and variance in test performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/15/2020

NUIG-Shubhanker@Dravidian-CodeMix-FIRE2020: Sentiment Analysis of Code-Mixed Dravidian text using XLNet

Social media has penetrated into multilingual societies, however most of...
research
02/15/2018

JU_KS@SAIL_CodeMixed-2017: Sentiment Analysis for Indian Code Mixed Social Media Texts

This paper reports about our work in the NLP Tool Contest @ICON-2017, sh...
research
12/27/2019

Language Independent Sentiment Analysis

Social media platforms and online forums generate rapid and increasing a...
research
11/02/2016

Towards Sub-Word Level Compositions for Sentiment Analysis of Hindi-English Code Mixed Text

Sentiment analysis (SA) using code-mixed data from social media has seve...
research
11/15/2021

IIITT@Dravidian-CodeMix-FIRE2021: Transliterate or translate? Sentiment analysis of code-mixed text in Dravidian languages

Sentiment analysis of social media posts and comments for various market...
research
10/09/2017

Deep Learning Paradigm with Transformed Monolingual Word Embeddings for Multilingual Sentiment Analysis

The surge of social media use brings huge demand of multilingual sentime...
research
04/11/2020

Classification Benchmarks for Under-resourced Bengali Language based on Multichannel Convolutional-LSTM Network

Exponential growths of social media and micro-blogging sites not only pr...

Please sign up or login with your details

Forgot password? Click here to reset