Statistical Decision Making for Optimal Budget Allocation in Crowd Labeling

03/12/2014
by   Xi Chen, et al.
0

In crowd labeling, a large amount of unlabeled data instances are outsourced to a crowd of workers. Workers will be paid for each label they provide, but the labeling requester usually has only a limited amount of the budget. Since data instances have different levels of labeling difficulty and workers have different reliability, it is desirable to have an optimal policy to allocate the budget among all instance-worker pairs such that the overall labeling accuracy is maximized. We consider categorical labeling tasks and formulate the budget allocation problem as a Bayesian Markov decision process (MDP), which simultaneously conducts learning and decision making. Using the dynamic programming (DP) recurrence, one can obtain the optimal allocation policy. However, DP quickly becomes computationally intractable when the size of the problem increases. To solve this challenge, we propose a computationally efficient approximate policy, called optimistic knowledge gradient policy. Our MDP is a quite general framework, which applies to both pull crowdsourcing marketplaces with homogeneous workers and push marketplaces with heterogeneous workers. It can also incorporate the contextual information of instances when they are available. The experiments on both simulated and real data show that the proposed policy achieves a higher labeling accuracy than other existing policies at the same budget level.

READ FULL TEXT
research
12/21/2016

Bayesian Decision Process for Cost-Efficient Dynamic Ranking via Crowdsourcing

Rank aggregation based on pairwise comparisons over a set of items has a...
research
12/31/2015

Bayes-Optimal Effort Allocation in Crowdsourcing: Bounds and Index Policies

We consider effort allocation in crowdsourcing, where we wish to assign ...
research
11/06/2017

Sequential Multi-Class Labeling in Crowdsourcing

We consider a crowdsourcing platform where workers' responses to questio...
research
08/31/2016

Dynamic Allocation of Crowd Contributions for Sentiment Analysis during the 2016 U.S. Presidential Election

Opinions about the 2016 U.S. Presidential Candidates have been expressed...
research
11/07/2021

Crowdsourcing with Meta-Workers: A New Way to Save the Budget

Due to the unreliability of Internet workers, it's difficult to complete...
research
05/22/2018

HyTasker: Hybrid Task Allocation in Mobile Crowd Sensing

Task allocation is a major challenge in Mobile Crowd Sensing (MCS). Whil...
research
10/14/2016

Tuning Crowdsourced Human Computation

As the use of crowdsourcing increases, it is important to think about pe...

Please sign up or login with your details

Forgot password? Click here to reset