The Lab vs The Crowd: An Investigation into Data Quality for Neural Dialogue Models

12/07/2020
by   José Lopes, et al.
0

Challenges around collecting and processing quality data have hampered progress in data-driven dialogue models. Previous approaches are moving away from costly, resource-intensive lab settings, where collection is slow but where the data is deemed of high quality. The advent of crowd-sourcing platforms, such as Amazon Mechanical Turk, has provided researchers with an alternative cost-effective and rapid way to collect data. However, the collection of fluid, natural spoken or textual interaction can be challenging, particularly between two crowd-sourced workers. In this study, we compare the performance of dialogue models for the same interaction task but collected in two different settings: in the lab vs. crowd-sourced. We find that fewer lab dialogues are needed to reach similar accuracy, less than half the amount of lab data as crowd-sourced data. We discuss the advantages and disadvantages of each data collection method.

READ FULL TEXT
research
01/10/2018

Exploring Stereotypes and Biased Data with the Crowd

The goal of our research is to contribute information about how useful t...
research
02/14/2022

ArgSciChat: A Dataset for Argumentative Dialogues on Scientific Papers

The applications of conversational agents for scientific disciplines (as...
research
05/17/2019

Comparison-Based Framework for Psychophysics: Lab versus Crowdsourcing

Traditionally, psychophysical experiments are conducted by repeated meas...
research
07/18/2018

Robot Learning in Homes: Improving Generalization and Reducing Dataset Bias

Data-driven approaches to solving robotic tasks have gained a lot of tra...
research
03/17/2019

TurkScanner: Predicting the Hourly Wage of Microtasks

Workers in crowd markets struggle to earn a living. One reason for this ...
research
07/08/2021

Crowd Sensing and Living Lab Outdoor Experimentation Made Easy

Living lab outdoor experimentation using pervasive computing provides ne...

Please sign up or login with your details

Forgot password? Click here to reset