Mitigating Observation Biases in Crowdsourced Label Aggregation

02/25/2023
by   Ryosuke Ueda, et al.
0

Crowdsourcing has been widely used to efficiently obtain labeled datasets for supervised learning from large numbers of human resources at low cost. However, one of the technical challenges in obtaining high-quality results from crowdsourcing is dealing with the variability and bias caused by the fact that it is humans execute the work, and various studies have addressed this issue to improve the quality by integrating redundantly collected responses. In this study, we focus on the observation bias in crowdsourcing. Variations in the frequency of worker responses and the complexity of tasks occur, which may affect the aggregation results when they are correlated with the quality of the responses. We also propose statistical aggregation methods for crowdsourcing responses that are combined with an observational data bias removal method used in causal inference. Through experiments using both synthetic and real datasets with/without artificially injected spam and colluding workers, we verify that the proposed method improves the aggregation accuracy in the presence of strong observation biases and robustness to both spam and colluding workers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/11/2021

Full Characterization of Adaptively Strong Majority Voting in Crowdsourcing

A commonly used technique for quality control in crowdsourcing is to tas...
research
01/04/2017

Probabilistic Multigraph Modeling for Improving the Quality of Crowdsourced Affective Data

We proposed a probabilistic approach to joint modeling of participants' ...
research
12/05/2018

A Technical Survey on Statistical Modelling and Design Methods for Crowdsourcing Quality Control

Online crowdsourcing provides a scalable and inexpensive means to collec...
research
05/17/2019

MiSC: Mixed Strategies Crowdsourcing

Popular crowdsourcing techniques mostly focus on evaluating workers' lab...
research
10/07/2021

Detecting adversaries in Crowdsourcing

Despite its successes in various machine learning and data science tasks...
research
07/21/2017

Autocompletion interfaces make crowd workers slower, but their use promotes response diversity

Creative tasks such as ideation or question proposal are powerful applic...
research
11/19/2022

Quantifying Human Bias and Knowledge to guide ML models during Training

This paper discusses a crowdsourcing based method that we designed to qu...

Please sign up or login with your details

Forgot password? Click here to reset