Angler: Helping Machine Translation Practitioners Prioritize Model Improvements

04/12/2023
by   Samantha Robertson, et al.
0

Machine learning (ML) models can fail in unexpected ways in the real world, but not all model failures are equal. With finite time and resources, ML practitioners are forced to prioritize their model debugging and improvement efforts. Through interviews with 13 ML practitioners at Apple, we found that practitioners construct small targeted test sets to estimate an error's nature, scope, and impact on users. We built on this insight in a case study with machine translation models, and developed Angler, an interactive visual analytics tool to help practitioners prioritize model improvements. In a user study with 7 machine translation experts, we used Angler to understand prioritization practices when the input space is infinite, and obtaining reliable signals of model quality is expensive. Our study revealed that participants could form more interesting and user-focused hypotheses for prioritization by analyzing quantitative summary statistics and qualitatively assessing data by reading sentences.

READ FULL TEXT

page 1

page 9

page 11

page 12

research
02/23/2023

Addressing UX Practitioners' Challenges in Designing ML Applications: an Interactive Machine Learning Approach

UX practitioners face novel challenges when designing user interfaces fo...
research
08/28/2022

An Empirical Study on the Usage of Automated Machine Learning Tools

The popularity of automated machine learning (AutoML) tools in different...
research
07/09/2019

The What-If Tool: Interactive Probing of Machine Learning Models

A key challenge in developing and deploying Machine Learning (ML) system...
research
03/03/2022

Why Do Machine Learning Practitioners Still Use Manual Tuning? A Qualitative Study

Current advanced hyperparameter optimization (HPO) methods, such as Baye...
research
09/19/2021

An Exploration And Validation of Visual Factors in Understanding Classification Rule Sets

Rule sets are often used in Machine Learning (ML) as a way to communicat...
research
04/30/2023

SoK: Pragmatic Assessment of Machine Learning for Network Intrusion Detection

Machine Learning (ML) has become a valuable asset to solve many real-wor...
research
02/09/2023

Zeno: An Interactive Framework for Behavioral Evaluation of Machine Learning

Machine learning models with high accuracy on test data can still produc...

Please sign up or login with your details

Forgot password? Click here to reset