Crowdsourcing Utilizing Subgroup Structure of Latent Factor Modeling

02/05/2023
by   Qi Xu, et al.
0

Crowdsourcing has emerged as an alternative solution for collecting large scale labels. However, the majority of recruited workers are not domain experts, so their contributed labels could be noisy. In this paper, we propose a two-stage model to predict the true labels for multicategory classification tasks in crowdsourcing. In the first stage, we fit the observed labels with a latent factor model and incorporate subgroup structures for both tasks and workers through a multi-centroid grouping penalty. Group-specific rotations are introduced to align workers with different task categories to solve multicategory crowdsourcing tasks. In the second stage, we propose a concordance-based approach to identify high-quality worker subgroups who are relied upon to assign labels to tasks. In theory, we show the estimation consistency of the latent factors and the prediction consistency of the proposed method. The simulation studies show that the proposed method outperforms the existing competitive methods, assuming the subgroup structures within tasks and workers. We also demonstrate the application of the proposed method to real world problems and show its superiority.

READ FULL TEXT

page 17

page 30

research
02/26/2018

Millionaire: A Hint-guided Approach for Crowdsourcing

Modern machine learning is migrating to the era of complex models, which...
research
06/01/2020

Variational Bayesian Inference for Crowdsourcing Predictions

Crowdsourcing has emerged as an effective means for performing a number ...
research
02/14/2016

Embracing Error to Enable Rapid Crowdsourcing

Microtask crowdsourcing has enabled dataset advances in social science a...
research
02/14/2023

A Provably Improved Algorithm for Crowdsourcing with Hard and Easy Tasks

Crowdsourcing is a popular method used to estimate ground-truth labels b...
research
08/05/2023

Crowdsourcing Fraud Detection over Heterogeneous Temporal MMMA Graph

The rise of the click farm business using Multi-purpose Messaging Mobile...
research
01/04/2017

Probabilistic Multigraph Modeling for Improving the Quality of Crowdsourced Affective Data

We proposed a probabilistic approach to joint modeling of participants' ...
research
02/10/2016

Feature Based Task Recommendation in Crowdsourcing with Implicit Observations

Existing research in crowdsourcing has investigated how to recommend tas...

Please sign up or login with your details

Forgot password? Click here to reset