Crowd-Powered Data Mining

06/13/2018
by   Chengliang Chai, et al.
0

Many data mining tasks cannot be completely addressed by automated processes, such as sentiment analysis and image classification. Crowdsourcing is an effective way to harness the human cognitive ability to process these machine-hard tasks. Thanks to public crowdsourcing platforms, e.g., Amazon Mechanical Turk and CrowdFlower, we can easily involve hundreds of thousands of ordi- nary workers (i.e., the crowd) to address these machine-hard tasks. In this tutorial, we will survey and synthesize a wide spectrum of existing studies on crowd-powered data mining. We rst give an overview of crowdsourcing, and then summarize the fundamental techniques, including quality control, cost control, and latency control, which must be considered in crowdsourced data mining. Next we review crowd-powered data mining operations, including classification, clustering, pattern mining, outlier detection, knowledge base construction and enrichment. Finally, we provide the emerging challenges in crowdsourced data mining.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/24/2020

What do crowd workers think about creative work?

Crowdsourcing platforms are a powerful and convenient means for recruiti...
research
02/09/2019

Replication Can Improve Prior Results: A GitHub Study of Pull Request Acceptance

Crowdsourcing and data mining can be used to effectively reduce the effo...
research
08/07/2019

From Crowdsourcing to Crowdmining: Using Implicit Human Intelligence for Better Understanding of Crowdsourced Data

With the development of mobile social networks, more and more crowdsourc...
research
03/07/2023

Crowdsourcing in Precision Healthcare: Short Review

The age of deep learning has brought high-performing diagnostic models f...
research
09/05/2020

NF-Crowd: Nearly-free Blockchain-based Crowdsourcing

Advancements in distributed ledger technologies are rapidly driving the ...
research
05/16/2017

Subjective Knowledge Acquisition and Enrichment Powered By Crowdsourcing

Knowledge bases (KBs) have attracted increasing attention due to its gre...
research
07/20/2023

Spatial-Temporal Data Mining for Ocean Science: Data, Methodologies, and Opportunities

With the increasing amount of spatial-temporal (ST) ocean data, numerous...

Please sign up or login with your details

Forgot password? Click here to reset