Curating Social Media Data

02/21/2020
by   Kushal Vaghani, et al.
0

Social media platforms have empowered the democratization of the pulse of people in the modern era. Due to its immense popularity and high usage, data published on social media sites (e.g., Twitter, Facebook and Tumblr) is a rich ocean of information. Therefore data-driven analytics of social imprints has become a vital asset for organisations and governments to further improve their products and services. However, due to the dynamic and noisy nature of social media data, performing accurate analysis on raw data is a challenging task. A key requirement is to curate the raw data before fed into analytics pipelines. This curation process transforms the raw data into contextualized data and knowledge. We propose a data curation pipeline, namely CrowdCorrect, to enable analysts cleansing and curating social data and preparing it for reliable analytics. Our pipeline provides an automatic feature extraction from a corpus of social media data using existing in-house tools. Further, we offer a dual-correction mechanism using both automated and crowd-sourced approaches. The implementation of this pipeline also includes a set of tools for automatically creating micro-tasks to facilitate the contribution of crowd users in curating the raw data. For the purposes of this research, we use Twitter as our motivational social media data platform due to its popularity.

READ FULL TEXT
research
08/27/2022

An event detection technique using social media data

People post information about different topics which are in their active...
research
03/31/2020

Social Media Mining Toolkit (SMMT)

There has been a dramatic increase in the popularity of utilizing social...
research
11/27/2020

Post or Tweet: Lessons from a Study of Facebook and Twitter Usage

This workshop paper reports on an ongoing mixed-methods study on the two...
research
03/02/2023

Building Dynamic Ontological Models for Place using Social Media Data from Twitter and Sina Weibo

Place holds human thoughts and experiences. Space is defined with geomet...
research
02/21/2018

Intent Classification using Feature Sets for Domestic Violence Discourse on Social Media

Domestic Violence against women is now recognized to be a serious and wi...
research
06/10/2017

Characterizing and Predicting Supply-side Engagement on Crowd-contributed Video Sharing Platforms

Video sharing and entertainment websites have rapidly grown in popularit...
research
01/25/2018

SocialML: machine learning for social media video creators

In the recent years, social media have become one of the main places whe...

Please sign up or login with your details

Forgot password? Click here to reset