Understanding Task Design Trade-offs in Crowdsourced Paraphrase Collection

04/19/2017
by   Youxuan Jiang, et al.
0

Linguistically diverse datasets are critical for training and evaluating robust machine learning systems, but data collection is a costly process that often requires experts. Crowdsourcing the process of paraphrase generation is an effective means of expanding natural language datasets, but there has been limited analysis of the trade-offs that arise when designing tasks. In this paper, we present the first systematic study of the key factors in crowdsourcing paraphrase collection. We consider variations in instructions, incentives, data domains, and workflows. We manually analyzed paraphrases for correctness, grammaticality, and linguistic diversity. Our observations provide new insight into the trade-offs between accuracy and diversity in crowd responses that arise as a result of task design, providing guidance for future paraphrase generation procedures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/09/2021

Trade-offs in the Design of Multimodal Interaction for Older Adults

This paper presents key aspects and trade-offs that designers and Human-...
research
05/12/2022

On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data

Borrowing ideas from Production functions in micro-economics, in this pa...
research
12/10/2017

Analysis-of-marginal-Tail-Means - a new method for robust parameter optimization

This paper presents a novel method, called Analysis-of-marginal-Tail-Mea...
research
09/27/2021

The Forgotten Preconditions for a Well-Functioning Internet

For decades, proponents of the Internet have promised that it would one ...
research
02/05/2019

An Exploratory Study on Visual Exploration of Model Simulations by Multiple Types of Experts

Experts in different domains rely increasingly on simulation models of c...
research
09/18/2019

Diversity-enabled sweet spots in layered architectures and speed-accuracy trade-offs in sensorimotor control

Nervous systems sense, communicate, compute, and actuate movement using ...
research
12/10/2019

Form + Function: Optimizing Aesthetic Product Design via Adaptive, Geometrized Preference Elicitation

Visual design is critical to product success, and the subject of intensi...

Please sign up or login with your details

Forgot password? Click here to reset