Subject Specific Stream Classification Preprocessing Algorithm for Twitter Data Stream

05/28/2017
by   Nisansa de Silva, et al.
0

Micro-blogging service Twitter is a lucrative source for data mining applications on global sentiment. But due to the omnifariousness of the subjects mentioned in each data item; it is inefficient to run a data mining algorithm on the raw data. This paper discusses an algorithm to accurately classify the entire stream in to a given number of mutually exclusive collectively exhaustive streams upon each of which the data mining algorithm can be run separately yielding more relevant results with a high efficiency.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/20/2019

Preprocessing Methods and Pipelines of Data Mining: An Overview

Data mining is about obtaining new knowledge from existing datasets. How...
research
12/01/2020

Applying data mining and machine learning techniques for sentiment shifter identification

Sentiment shifters, as a set of words and expressions that can affect te...
research
09/19/2017

CASP-DM: Context Aware Standard Process for Data Mining

We propose an extension of the Cross Industry Standard Process for Data ...
research
11/11/2012

Mining Determinism in Human Strategic Behavior

This work lies in the fusion of experimental economics and data mining. ...
research
08/21/2019

Visualization in the preprocessing phase: an interview study with enterprise professionals

The current information age has increasingly required organizations to b...
research
10/30/2022

A Pipeline for Analysing Grant Applications

Data mining techniques can transform massive amounts of unstructured dat...
research
01/15/2019

Data-driven Modelling of Smart Building Ventilation Subsystem

Considering the advances in building monitoring and control through netw...

Please sign up or login with your details

Forgot password? Click here to reset