Investigating Classification Techniques with Feature Selection For Intention Mining From Twitter Feed

01/22/2020
by   Qadri Mishael, et al.
0

In the last decade, social networks became most popular medium for communication and interaction. As an example, micro-blogging service Twitter has more than 200 million registered users who exchange more than 65 million posts per day. Users express their thoughts, ideas, and even their intentions through these tweets. Most of the tweets are written informally and often in slang language, that contains misspelt and abbreviated words. This paper investigates the problem of selecting features that affect extracting user's intention from Twitter feeds based on text mining techniques. It starts by presenting the method we used to construct our own dataset from extracted Twitter feeds. Following that, we present two techniques of feature selection followed by classification. In the first technique, we use Information Gain as a one-phase feature selection, followed by supervised classification algorithms. In the second technique, we use a hybrid approach based on forward feature selection algorithm in which two feature selection techniques employed followed by classification algorithms. We examine these two techniques with four classification algorithms. We evaluate them using our own dataset, and we critically review the results.

READ FULL TEXT

page 18

page 19

research
07/11/2020

Feature Selection on Noisy Twitter Short Text Messages for Language Identification

The task of written language identification involves typically the detec...
research
11/30/2020

Twitter Spam Detection: A Systematic Review

Nowadays, with the rise of Internet access and mobile devices around the...
research
07/01/2020

Understanding phishers' strategies of mimicking uniform resource locators to leverage phishing attacks: A machine learning approach

Phishing is a type of social engineering attack with an intention to ste...
research
08/27/2017

Impact of Feature Selection on Micro-Text Classification

Social media datasets, especially Twitter tweets, are popular in the fie...
research
03/06/2017

Performing Stance Detection on Twitter Data using Computational Linguistics Techniques

As humans, we can often detect from a persons utterances if he or she is...
research
04/20/2018

twAwler: A lightweight twitter crawler

This paper presents twAwler, a lightweight twitter crawler that targets ...
research
10/24/2018

A Text Classification Application: Poet Detection from Poetry

With the widespread use of the internet, the size of the text data incre...

Please sign up or login with your details

Forgot password? Click here to reset