Time-Sensitive Bayesian Information Aggregation for Crowdsourcing Systems

10/21/2015
by   Matteo Venanzi, et al.
0

Crowdsourcing systems commonly face the problem of aggregating multiple judgments provided by potentially unreliable workers. In addition, several aspects of the design of efficient crowdsourcing processes, such as defining worker's bonuses, fair prices and time limits of the tasks, involve knowledge of the likely duration of the task at hand. Bringing this together, in this work we introduce a new time--sensitive Bayesian aggregation method that simultaneously estimates a task's duration and obtains reliable aggregations of crowdsourced judgments. Our method, called BCCTime, builds on the key insight that the time taken by a worker to perform a task is an important indicator of the likely quality of the produced judgment. To capture this, BCCTime uses latent variables to represent the uncertainty about the workers' completion time, the tasks' duration and the workers' accuracy. To relate the quality of a judgment to the time a worker spends on a task, our model assumes that each task is completed within a latent time window within which all workers with a propensity to genuinely attempt the labelling task (i.e., no spammers) are expected to submit their judgments. In contrast, workers with a lower propensity to valid labeling, such as spammers, bots or lazy labelers, are assumed to perform tasks considerably faster or slower than the time required by normal workers. Specifically, we use efficient message-passing Bayesian inference to learn approximate posterior probabilities of (i) the confusion matrix of each worker, (ii) the propensity to valid labeling of each worker, (iii) the unbiased duration of each task and (iv) the true label of each task. Using two real-world public datasets for entity linking tasks, we show that BCCTime produces up to 11 informative estimates of a task's duration compared to state-of-the-art methods.

READ FULL TEXT

page 8

page 9

page 10

page 21

research
06/01/2020

Variational Bayesian Inference for Crowdsourcing Predictions

Crowdsourcing has emerged as an effective means for performing a number ...
research
10/25/2020

Exploiting Heterogeneous Graph Neural Networks with Latent Worker/Task Correlation Information for Label Aggregation in Crowdsourcing

Crowdsourcing has attracted much attention for its convenience to collec...
research
02/08/2023

AVeCQ: Anonymous Verifiable Crowdsourcing with Worker Qualities

In crowdsourcing systems, requesters publish tasks, and interested worke...
research
05/17/2019

MiSC: Mixed Strategies Crowdsourcing

Popular crowdsourcing techniques mostly focus on evaluating workers' lab...
research
02/12/2016

A Truthful Mechanism with Biparameter Learning for Online Crowdsourcing

We study a problem of allocating divisible jobs, arriving online, to wor...
research
11/11/2021

Full Characterization of Adaptively Strong Majority Voting in Crowdsourcing

A commonly used technique for quality control in crowdsourcing is to tas...
research
02/28/2017

Iterative Bayesian Learning for Crowdsourced Regression

Crowdsourcing platforms emerged as popular venues for purchasing human i...

Please sign up or login with your details

Forgot password? Click here to reset