Sequential Multi-Class Labeling in Crowdsourcing

11/06/2017
by   Qiyu Kang, et al.
0

We consider a crowdsourcing platform where workers' responses to questions posed by a crowdsourcer are used to determine the hidden state of a multi-class labeling problem. As workers may be unreliable, we propose to perform sequential questioning in which the questions posed to the workers are designed based on previous questions and answers. We propose a Partially-Observable Markov Decision Process (POMDP) framework to determine the best questioning strategy, subject to the crowdsourcer's budget constraint. As this POMDP formulation is in general intractable, we develop a suboptimal approach based on a q-ary Ulam-Rényi game. We also propose a sampling heuristic, which can be used in tandem with standard POMDP solvers, using our Ulam-Rényi strategy. We demonstrate through simulations that our approaches outperform a non-sequential strategy based on error correction coding and which does not utilize workers' previous responses.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/12/2014

Statistical Decision Making for Optimal Budget Allocation in Crowd Labeling

In crowd labeling, a large amount of unlabeled data instances are outsou...
research
07/21/2017

Autocompletion interfaces make crowd workers slower, but their use promotes response diversity

Creative tasks such as ideation or question proposal are powerful applic...
research
12/21/2016

Bayesian Decision Process for Cost-Efficient Dynamic Ranking via Crowdsourcing

Rank aggregation based on pairwise comparisons over a set of items has a...
research
02/12/2018

Distinguishing Question Subjectivity from Difficulty for Improved Crowdsourcing

The questions in a crowdsourcing task typically exhibit varying degrees ...
research
12/31/2015

Bayes-Optimal Effort Allocation in Crowdsourcing: Bounds and Index Policies

We consider effort allocation in crowdsourcing, where we wish to assign ...
research
05/02/2019

Truth Discovery via Proxy Voting

Truth discovery is a general name for a broad range of statistical metho...
research
09/11/2018

Reducing Uncertainty of Schema Matching via Crowdsourcing with Accuracy Rates

Schema matching is a central challenge for data integration systems. Ins...

Please sign up or login with your details

Forgot password? Click here to reset