Combining Lexical Features and a Supervised Learning Approach for Arabic Sentiment Analysis

10/23/2017
by   Samhaa R. El-Beltagy, et al.
0

The importance of building sentiment analysis tools for Arabic social media has been recognized during the past couple of years, especially with the rapid increase in the number of Arabic social media users. One of the main difficulties in tackling this problem is that text within social media is mostly colloquial, with many dialects being used within social media platforms. In this paper, we present a set of features that were integrated with a machine learning based sentiment analysis model and applied on Egyptian, Saudi, Levantine, and MSA Arabic social media datasets. Many of the proposed features were derived through the use of an Arabic Sentiment Lexicon. The model also presents emoticon based features, as well as input text related features such as the number of segments within the text, the length of the text, whether the text ends with a question mark or not, etc. We show that the presented features have resulted in an increased accuracy across six of the seven datasets we've experimented with and which are all benchmarked. Since the developed model out-performs all existing Arabic sentiment analysis systems that have publicly available datasets, we can state that this model presents state-of-the-art in Arabic sentiment analysis.

READ FULL TEXT
research
11/15/2015

A System for Extracting Sentiment from Large-Scale Arabic Social Data

Social media data in Arabic language is becoming more and more abundant....
research
09/08/2018

Sentiment analysis for Arabic language: A brief survey of approaches and techniques

With the emergence of Web 2.0 technology and the expansion of on-line so...
research
12/30/2019

AraNet: A Deep Learning Toolkit for Arabic Social Media

We describe AraNet, a collection of deep learning Arabic social media pr...
research
04/23/2019

Empirical Evaluation of Leveraging Named Entities for Arabic Sentiment Analysis

Social media reflects the public attitudes towards specific events. Even...
research
08/15/2018

SentiALG: Automated Corpus Annotation for Algerian Sentiment Analysis

Data annotation is an important but time-consuming and costly procedure....
research
08/17/2016

SlangSD: Building and Using a Sentiment Dictionary of Slang Words for Short-Text Sentiment Classification

Sentiment in social media is increasingly considered as an important res...

Please sign up or login with your details

Forgot password? Click here to reset