Analysis of Minimax Error Rate for Crowdsourcing and Its Application to Worker Clustering Model

02/13/2018
by   Hideaki Imamura, et al.
0

While crowdsourcing has become an important means to label data, crowdworkers are not always experts---sometimes they can even be adversarial. Therefore, there is great interest in estimating the ground truth from unreliable labels produced by crowdworkers. The Dawid and Skene (DS) model is one of the most well-known models in the study of crowdsourcing. Despite its practical popularity, theoretical error analysis for the DS model has been conducted only under restrictive assumptions on, e.g., class priors, confusion matrices, and the number of labels each worker provides. In this paper, we derive a minimax error rate under more practical setting for a broader class of crowdsourcing models that includes the DS model as a special case. We further propose the worker clustering model, which is more practical than the DS model under real crowdsourcing settings. Note that the wide applicability of our theoretical analysis allows us to immediately investigate the behavior of this proposed model. Experimental results showed that there is a strong similarity between the lower bound of the minimax error rate derived by our theoretical analysis and the empirical error of the estimated value.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/25/2015

Regularized Minimax Conditional Entropy for Crowdsourcing

There is a rapidly increasing interest in crowdsourcing for data labelin...
research
06/01/2016

A Minimax Optimal Algorithm for Crowdsourcing

We consider the problem of accurately estimating the reliability of work...
research
11/15/2014

Error Rate Bounds and Iterative Weighted Majority Voting for Crowdsourcing

Crowdsourcing has become an effective and popular tool for human-powered...
research
09/10/2017

Rates of Convergence of Spectral Methods for Graphon Estimation

This paper studies the problem of estimating the grahpon model - the und...
research
05/19/2019

Teaching decision theory proof strategies using a crowdsourcing problem

Teaching how to derive minimax decision rules can be challenging because...
research
10/19/2015

Stochastically Transitive Models for Pairwise Comparisons: Statistical and Computational Issues

There are various parametric models for analyzing pairwise comparison da...
research
07/10/2013

Error Rate Bounds in Crowdsourcing Models

Crowdsourcing is an effective tool for human-powered computation on many...

Please sign up or login with your details

Forgot password? Click here to reset