T-Crowd: Effective Crowdsourcing for Tabular Data

08/07/2017
by   Caihua Shan, et al.
0

Crowdsourcing employs human workers to solve computer-hard problems, such as data cleaning, entity resolution, and sentiment analysis. When crowdsourcing tabular data, e.g., the attribute values of an entity set, a worker's answers on the different attributes (e.g., the nationality and age of a celebrity star) are often treated independently. This assumption is not always true and can lead to suboptimal crowdsourcing performance. In this paper, we present the T-Crowd system, which takes into consideration the intricate relationships among tasks, in order to converge faster to their true values. Particularly, T-Crowd integrates each worker's answers on different attributes to effectively learn his/her trustworthiness and the true data values. The attribute relationship information is also used to guide task allocation to workers. Finally, T-Crowd seamlessly supports categorical and continuous attributes, which are the two main datatypes found in typical databases. Our extensive experiments on real and synthetic datasets show that T-Crowd outperforms state-of-the-art methods in terms of truth inference and reducing the cost of crowdsourcing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/07/2019

Accurate inference of crowdsourcing properties when using efficient allocation strategies

Allocation strategies improve the efficiency of crowdsourcing by decreas...
research
08/24/2018

Truth Inference on Sparse Crowdsourcing Data with Local Differential Privacy

Crowdsourcing has arisen as a new problem-solving paradigm for tasks tha...
research
02/03/2017

A Theoretical Analysis of First Heuristics of Crowdsourced Entity Resolution

Entity resolution (ER) is the task of identifying all records in a datab...
research
02/08/2023

Multiview Representation Learning from Crowdsourced Triplet Comparisons

Crowdsourcing has been used to collect data at scale in numerous fields....
research
02/14/2016

Embracing Error to Enable Rapid Crowdsourcing

Microtask crowdsourcing has enabled dataset advances in social science a...
research
11/07/2021

Open-Set Crowdsourcing using Multiple-Source Transfer Learning

We raise and define a new crowdsourcing scenario, open set crowdsourcing...
research
09/11/2018

Reducing Uncertainty of Schema Matching via Crowdsourcing with Accuracy Rates

Schema matching is a central challenge for data integration systems. Ins...

Please sign up or login with your details

Forgot password? Click here to reset