Full Characterization of Adaptively Strong Majority Voting in Crowdsourcing

11/11/2021
by   Margarita Boyarskaya, et al.
0

A commonly used technique for quality control in crowdsourcing is to task the workers with examining an item and voting on whether the item is labeled correctly. To counteract possible noise in worker responses, one solution is to keep soliciting votes from more workers until the difference between the numbers of votes for the two possible outcomes exceeds a pre-specified threshold δ. We show a way to model such δ-margin voting consensus aggregation process using absorbing Markov chains. We provide closed-form equations for the key properties of this voting process – namely, for the quality of the results, the expected number of votes to completion, the variance of the required number of votes, and other moments of the distribution. Using these results, we show further that one can adapt the value of the threshold δ to achieve quality-equivalence across voting processes that employ workers of different accuracy levels. We then use this result to provide efficiency-equalizing payment rates for groups of workers characterized by different levels of response accuracy. Finally, we perform a set of simulated experiments using both fully synthetic data as well as real-life crowdsourced votes. We show that our theoretical model characterizes the outcomes of the consensus aggregation process well.

READ FULL TEXT
research
05/17/2019

Graph Mining Meets Crowdsourcing: Extracting Experts for Answer Aggregation

Aggregating responses from crowd workers is a fundamental task in the pr...
research
02/25/2023

Mitigating Observation Biases in Crowdsourced Label Aggregation

Crowdsourcing has been widely used to efficiently obtain labeled dataset...
research
02/07/2022

Using Multiwinner Voting to Search for Movies

We show a prototype of a system that uses multiwinner voting to suggest ...
research
06/08/2019

Doubly Robust Crowdsourcing

Large-scale labeled datasets are the indispensable fuel that ignites the...
research
02/19/2015

Approval Voting and Incentives in Crowdsourcing

The growing need for labeled training data has made crowdsourcing an imp...
research
10/26/2017

Optimal Crowdsourced Classification with a Reject Option in the Presence of Spammers

We explore the design of an effective crowdsourcing system for an M-ary ...
research
10/21/2015

Time-Sensitive Bayesian Information Aggregation for Crowdsourcing Systems

Crowdsourcing systems commonly face the problem of aggregating multiple ...

Please sign up or login with your details

Forgot password? Click here to reset