Crowdsourced Classification with XOR Queries: Fundamental Limits and An Efficient Algorithm

01/31/2020
by   Daesung Kim, et al.
12

Crowdsourcing systems have emerged as an effective platform to label data and classify objects with relatively low cost by exploiting non-expert workers. To ensure reliable recovery of unknown labels with as few number of queries as possible, we consider an effective query type that asks "group attribute" of a chosen subset of objects. In particular, we consider the problem of classifying m binary labels with XOR queries that ask whether the number of objects having a given attribute in the chosen subset of size d is even or odd. The subset size d, which we call query degree, can be varying over queries. Since a worker needs to make more efforts to answer a query of a higher degree, we consider a noise model where the accuracy of worker's answer changes depending both on the worker reliability and query degree d. For this general model, we characterize the information-theoretic limit on the optimal number of queries to reliably recover m labels in terms of a given combination of degree-d queries and noise parameters. Further, we propose an efficient inference algorithm that achieves this limit even when the noise parameters are unknown.

READ FULL TEXT

page 3

page 4

page 6

page 7

page 12

page 13

page 21

page 23

research
11/19/2021

A Worker-Task Specialization Model for Crowdsourcing: Efficient Inference and Fundamental Limits

Crowdsourcing system has emerged as an effective platform to label data ...
research
03/21/2020

Crowdsourced Labeling for Worker-Task Specialization Block Model

We consider crowdsourced labeling under a worker-task specialization blo...
research
09/04/2018

Parity Crowdsourcing for Cooperative Labeling

Consider a database of k objects, e.g., a set of videos, where each obje...
research
11/24/2009

Group-based Query Learning for rapid diagnosis in time-critical situations

In query learning, the goal is to identify an unknown object while minim...
research
06/25/2019

Coding for Crowdsourced Classification with XOR Queries

This paper models the crowdsourced labeling/classification problem as a ...
research
03/31/2015

Crowdsourcing Feature Discovery via Adaptively Chosen Comparisons

We introduce an unsupervised approach to efficiently discover the underl...
research
12/04/2021

Efficient Deterministic Quantitative Group Testing for Precise Information Retrieval

The Quantitative Group Testing (QGT) is about learning a (hidden) subset...

Please sign up or login with your details

Forgot password? Click here to reset