Open-Set Crowdsourcing using Multiple-Source Transfer Learning

11/07/2021
by   Guangyang Han, et al.
0

We raise and define a new crowdsourcing scenario, open set crowdsourcing, where we only know the general theme of an unfamiliar crowdsourcing project, and we don't know its label space, that is, the set of possible labels. This is still a task annotating problem, but the unfamiliarity with the tasks and the label space hampers the modelling of the task and of workers, and also the truth inference. We propose an intuitive solution, OSCrowd. First, OSCrowd integrates crowd theme related datasets into a large source domain to facilitate partial transfer learning to approximate the label space inference of these tasks. Next, it assigns weights to each source domain based on category correlation. After this, it uses multiple-source open set transfer learning to model crowd tasks and assign possible annotations. The label space and annotations given by transfer learning will be used to guide and standardize crowd workers' annotations. We validate OSCrowd in an online scenario, and prove that OSCrowd solves the open set crowdsourcing problem, works better than related crowdsourcing solutions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/24/2019

Attention-Aware Answers of the Crowd

Crowdsourcing is a relatively economic and efficient solution to collect...
research
03/30/2017

Evaluating Complex Task through Crowdsourcing: Multiple Views Approach

With the popularity of massive open online courses, grading through crow...
research
06/23/2018

Optimizing the Wisdom of the Crowd: Inference, Learning, and Teaching

The unprecedented demand for large amount of data has catalyzed the tren...
research
08/07/2017

T-Crowd: Effective Crowdsourcing for Tabular Data

Crowdsourcing employs human workers to solve computer-hard problems, suc...
research
04/17/2018

Unlearn What You Have Learned: Adaptive Crowd Teaching with Exponentially Decayed Memory Learners

With the increasing demand for large amount of labeled data, crowdsourci...
research
05/01/2018

Capturing Ambiguity in Crowdsourcing Frame Disambiguation

FrameNet is a computational linguistics resource composed of semantic fr...
research
03/07/2023

Crowdsourcing in Precision Healthcare: Short Review

The age of deep learning has brought high-performing diagnostic models f...

Please sign up or login with your details

Forgot password? Click here to reset