Towards Ecologically Valid Research on Language User Interfaces

07/28/2020
by   Harm de Vries, et al.
0

Language User Interfaces (LUIs) could improve human-machine interaction for a wide variety of tasks, such as playing music, getting insights from databases, or instructing domestic robots. In contrast to traditional hand-crafted approaches, recent work attempts to build LUIs in a data-driven way using modern deep learning methods. To satisfy the data needs of such learning algorithms, researchers have constructed benchmarks that emphasize the quantity of collected data at the cost of its naturalness and relevance to real-world LUI use cases. As a consequence, research findings on such benchmarks might not be relevant for developing practical LUIs. The goal of this paper is to bootstrap the discussion around this issue, which we refer to as the benchmarks' low ecological validity. To this end, we describe what we deem an ideal methodology for machine learning research on LUIs and categorize five common ways in which recent benchmarks deviate from it. We give concrete examples of the five kinds of deviations and their consequences. Lastly, we offer a number of recommendations as to how to increase the ecological validity of machine learning research on LUIs.

READ FULL TEXT

page 2

page 5

page 6

research
01/04/2023

Validity in Music Information Research Experiments

Validity is the truth of an inference made from evidence, such as data c...
research
04/16/2019

HARK Side of Deep Learning -- From Grad Student Descent to Automated Machine Learning

Recent advancements in machine learning research, i.e., deep learning, i...
research
02/03/2021

Insiders and Outsiders in Research on Machine Learning and Society

A subset of machine learning research intersects with societal issues, i...
research
11/12/2018

Recent Research Advances on Interactive Machine Learning

Interactive Machine Learning (IML) is an iterative learning process that...
research
03/20/2018

Why not be Versatile? Applications of the SGNMT Decoder for Machine Translation

SGNMT is a decoding platform for machine translation which allows paring...
research
11/16/2021

HiRID-ICU-Benchmark – A Comprehensive Machine Learning Benchmark on High-resolution ICU Data

The recent success of machine learning methods applied to time series co...
research
07/13/2018

Deep Learning in the Wild

Deep learning with neural networks is applied by an increasing number of...

Please sign up or login with your details

Forgot password? Click here to reset