Needle in a Haystack: An Analysis of Finding Qualified Workers on MTurk for Summarization

12/20/2022

∙

The acquisition of high-quality human annotations through crowdsourcing platforms like Amazon Mechanical Turk (MTurk) is more challenging than expected. The annotation quality might be affected by various aspects like annotation instructions, Human Intelligence Task (HIT) design, and wages paid to annotators, etc. To avoid potentially low-quality annotations which could mislead the evaluation of automatic summarization system outputs, we investigate the recruitment of high-quality MTurk workers via a three-step qualification pipeline. We show that we can successfully filter out bad workers before they carry out the evaluations and obtain high-quality annotations while optimizing the use of resources. This paper can serve as basis for the recruitment of qualified annotators in other challenging annotation tasks.

READ FULL TEXT

Needle in a Haystack: An Analysis of Finding Qualified Workers on MTurk for Summarization

Sign in with Google

Consider DeepAI Pro