Coding for Crowdsourced Classification with XOR Queries

06/25/2019
by   James, et al.
0

This paper models the crowdsourced labeling/classification problem as a sparsely encoded source coding problem, where each query answer, regarded as a code bit, is the XOR of a small number of labels, as source information bits. In this paper we leverage the connections between this problem and well-studied codes with sparse representations for the channel coding problem to provide querying schemes with almost optimal number of queries, each of which involving only a constant number of labels. We also extend this scenario to the case where some workers can be unresponsive. For this case, we propose querying schemes where each query involves only log n items, where n is the total number of items to be labeled. Furthermore, we consider classification of two correlated labeling systems and provide two-stage querying schemes with almost optimal number of queries each involving a constant number of labels.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/31/2019

Semisupervised Clustering by Queries and Locally Encodable Source Coding

Source coding is the canonical problem of data compression in informatio...
research
11/19/2022

Comparison of different coding schemes for 1-bit ADC

This paper devotes to comparison of different coding schemes (various co...
research
07/12/2020

Efficient Labeling for Reachability in Digraphs

We consider labeling nodes of a directed graph for reachability queries....
research
01/31/2020

Crowdsourced Classification with XOR Queries: Fundamental Limits and An Efficient Algorithm

Crowdsourcing systems have emerged as an effective platform to label dat...
research
10/08/2022

Constrained Optimal Querying: Huffman Coding and Beyond

Huffman coding is well known to be useful in certain decision problems i...
research
01/11/2018

Privacy in Index Coding: Improved Bounds and Coding Schemes

It was recently observed in [1], that in index coding, learning the codi...
research
01/22/2020

Computing Similarity Queries for Correlated Gaussian Sources

Among many current data processing systems, the objectives are often not...

Please sign up or login with your details

Forgot password? Click here to reset