Error Rate Bounds in Crowdsourcing Models

07/10/2013
by   Hongwei Li, et al.
0

Crowdsourcing is an effective tool for human-powered computation on many tasks challenging for computers. In this paper, we provide finite-sample exponential bounds on the error rate (in probability and in expectation) of hyperplane binary labeling rules under the Dawid-Skene crowdsourcing model. The bounds can be applied to analyze many common prediction methods, including the majority voting and weighted majority voting. These bound results could be useful for controlling the error rate and designing better algorithms. We show that the oracle Maximum A Posterior (MAP) rule approximately optimizes our upper bound on the mean error rate for any hyperplane binary labeling rule, and propose a simple data-driven weighted majority voting (WMV) rule (called one-step WMV) that attempts to approximate the oracle MAP and has a provable theoretical guarantee on the error rate. Moreover, we use simulated and real data to demonstrate that the data-driven EM-MAP rule is a good approximation to the oracle MAP rule, and to demonstrate that the mean error rate of the data-driven EM-MAP rule is also bounded by the mean error rate bound of the oracle MAP rule with estimated parameters plugging into the bound.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/15/2014

Error Rate Bounds and Iterative Weighted Majority Voting for Crowdsourcing

Crowdsourcing has become an effective and popular tool for human-powered...
research
09/29/2021

Error rate control for classification rules in multiclass mixture models

In the context of finite mixture models one considers the problem of cla...
research
01/17/2022

Adjudication with Rational Jurors

We analyze a mechanism for adjudication involving majority voting and ra...
research
02/23/2016

A Streaming Algorithm for Crowdsourced Data Classification

We propose a streaming algorithm for the binary classification of data b...
research
09/18/2023

New Bounds on the Accuracy of Majority Voting for Multi-Class Classification

Majority voting is a simple mathematical function that returns the value...
research
02/13/2018

Analysis of Minimax Error Rate for Crowdsourcing and Its Application to Worker Clustering Model

While crowdsourcing has become an important means to label data, crowdwo...
research
07/10/2023

Beyond the Two-Trials Rule

The two-trials rule for drug approval requires "at least two adequate an...

Please sign up or login with your details

Forgot password? Click here to reset