Fast Dawid-Skene

03/07/2018
by   Vaibhav Sinha, et al.
0

Many real world problems can now be effectively solved using supervised machine learning. A major roadblock is often the lack of an adequate quantity of labeled data for training. A possible solution is to assign the task of labeling data to a crowd, and then infer the true label using aggregation methods. A well-known approach for aggregation is the Dawid-Skene (DS) algorithm, which is based on the principle of Expectation-Maximization (EM). We propose a new simple, yet effective, EM-based algorithm, which can be interpreted as a 'hard' version of DS, that allows much faster convergence while maintaining similar accuracy in aggregation. We also show how the proposed method can be extended to settings when there are multiple labels as well as for online vote aggregation. Our experiments on standard vote aggregation datasets show a significant speedup in time taken for convergence - upto ∼8x over Dawid-Skene and ∼6x over other fast EM methods, at competitive accuracy performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/23/2017

An Expectation Maximization Framework for Preferential Attachment Models

In this paper we develop an Expectation Maximization(EM) algorithm to es...
research
12/09/2020

Improving Gradient Flow with Unrolled Highway Expectation Maximization

Integrating model-based machine learning methods into deep neural archit...
research
06/22/2014

Divide-and-Conquer Learning by Anchoring a Conical Hull

We reduce a broad class of machine learning problems, usually addressed ...
research
06/30/2020

Sinkhorn EM: An Expectation-Maximization algorithm based on entropic optimal transport

We study Sinkhorn EM (sEM), a variant of the expectation maximization (E...
research
05/31/2013

Expectation-maximization for logistic regression

We present a family of expectation-maximization (EM) algorithms for bina...
research
11/19/2022

A Light-weight, Effective and Efficient Model for Label Aggregation in Crowdsourcing

Due to the noises in crowdsourced labels, label aggregation (LA) has eme...
research
04/12/2013

Towards more accurate clustering method by using dynamic time warping

An intrinsic problem of classifiers based on machine learning (ML) metho...

Please sign up or login with your details

Forgot password? Click here to reset