Wizard of Errors: Introducing and Evaluating Machine Learning Errors in Wizard of Oz Studies

02/17/2023
by   Anniek Jansen, et al.
0

When designing Machine Learning (ML) enabled solutions, designers often need to simulate ML behavior through the Wizard of Oz (WoZ) approach to test the user experience before the ML model is available. Although reproducing ML errors is essential for having a good representation, they are rarely considered. We introduce Wizard of Errors (WoE), a tool for conducting WoZ studies on ML-enabled solutions that allows simulating ML errors during user experience assessment. We explored how this system can be used to simulate the behavior of a computer vision model. We tested WoE with design students to determine the importance of considering ML errors in design, the relevance of using descriptive error types instead of confusion matrix, and the suitability of manual error control in WoZ studies. Our work identifies several challenges, which prevent realistic error representation by designers in such studies. We discuss the implications of these findings for design.

READ FULL TEXT

page 2

page 3

page 5

research
04/15/2022

A Catalogue of Concerns for Specifying Machine Learning-Enabled Systems

Requirements engineering (RE) activities for Machine Learning (ML) are n...
research
06/20/2022

Towards Perspective-Based Specification of Machine Learning-Enabled Systems

Machine learning (ML) teams often work on a project just to realize the ...
research
06/15/2023

AQuA: A Benchmarking Tool for Label Quality Assessment

Machine learning (ML) models are only as good as the data they are train...
research
04/02/2021

Using Simulation to Aid the Design and Optimization of Intelligent User Interfaces for Quality Assurance Processes in Machine Learning

Many mission-critical applications of machine learning (ML) in the real-...
research
06/02/2023

Concurrent Classifier Error Detection (CCED) in Large Scale Machine Learning Systems

The complexity of Machine Learning (ML) systems increases each year, wit...
research
01/21/2020

Designing for the Long Tail of Machine Learning

Recent technical advances has made machine learning (ML) a promising com...
research
02/24/2022

"Is not the truth the truth?": Analyzing the Impact of User Validations for Bus In/Out Detection in Smartphone-based Surveys

Passenger flow allows the study of users' behavior through the public ne...

Please sign up or login with your details

Forgot password? Click here to reset