Truth Inference on Sparse Crowdsourcing Data with Local Differential Privacy

08/24/2018
by   Haipei Sun, et al.
0

Crowdsourcing has arisen as a new problem-solving paradigm for tasks that are difficult for computers but easy for humans. However, since the answers collected from the recruited participants (workers) may contain sensitive information, crowdsourcing raises serious privacy concerns. In this paper, we investigate the problem of protecting answer privacy under local differential privacy (LDP), by which individual workers randomize their answers independently and send the perturbed answers to the task requester. The utility goal is to enable to infer the true answer (i.e., truth) from the perturbed data with high accuracy. One of the challenges of LDP perturbation is the sparsity of worker answers (i.e., each worker only answers a small number of tasks). Simple extension of the existing approaches (e.g., Laplace perturbation and randomized response) may incur large error of truth inference on sparse data. Thus we design an efficient new matrix factorization (MF) algorithm under LDP. We prove that our MF algorithm can provide both LDP guarantee and small error of truth inference, regardless of the sparsity of worker answers. We perform extensive experiments on real-world and synthetic datasets, and demonstrate that the MF algorithm performs better than the existing LDP algorithms on sparse crowdsourcing data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/07/2017

T-Crowd: Effective Crowdsourcing for Tabular Data

Crowdsourcing employs human workers to solve computer-hard problems, suc...
research
07/10/2020

From Task Tuning to Task Assignment in Privacy-Preserving Crowdsourcing Platforms

Specialized worker profiles of crowdsourcing platforms may contain a lar...
research
11/01/2014

How Many Workers to Ask? Adaptive Exploration for Collecting High Quality Labels

Crowdsourcing has been part of the IR toolbox as a cheap and fast mechan...
research
06/16/2018

Efficient Crowdsourcing via Proxy Voting

Crowdsourcing platforms offer a way to label data by aggregating answers...
research
05/02/2019

Truth Discovery via Proxy Voting

Truth discovery is a general name for a broad range of statistical metho...
research
08/20/2021

Privacy-Preserving Batch-based Task Assignment in Spatial Crowdsourcing with Untrusted Server

In this paper, we study the privacy-preserving task assignment in spatia...
research
02/28/2017

Iterative Bayesian Learning for Crowdsourced Regression

Crowdsourcing platforms emerged as popular venues for purchasing human i...

Please sign up or login with your details

Forgot password? Click here to reset