Why Do Machine Learning Practitioners Still Use Manual Tuning? A Qualitative Study

03/03/2022
by   Niklas Hasebrook, et al.
0

Current advanced hyperparameter optimization (HPO) methods, such as Bayesian optimization, have high sampling efficiency and facilitate replicability. Nonetheless, machine learning (ML) practitioners (e.g., engineers, scientists) mostly apply less advanced HPO methods, which can increase resource consumption during HPO or lead to underoptimized ML models. Therefore, we suspect that practitioners choose their HPO method to achieve different goals, such as decrease practitioner effort and target audience compliance. To develop HPO methods that align with such goals, the reasons why practitioners decide for specific HPO methods must be unveiled and thoroughly understood. Because qualitative research is most suitable to uncover such reasons and find potential explanations for them, we conducted semi-structured interviews to explain why practitioners choose different HPO methods. The interviews revealed six principal practitioner goals (e.g., increasing model comprehension), and eleven key factors that impact decisions for HPO methods (e.g., available computing resources). We deepen the understanding about why practitioners decide for different HPO methods and outline recommendations for improvements of HPO methods by aligning them with practitioner goals.

READ FULL TEXT
research
10/06/2021

Machine Learning Practices Outside Big Tech: How Resource Constraints Challenge Responsible Development

Practitioners from diverse occupations and backgrounds are increasingly ...
research
02/23/2023

Addressing UX Practitioners' Challenges in Designing ML Applications: an Interactive Machine Learning Approach

UX practitioners face novel challenges when designing user interfaces fo...
research
04/12/2023

Angler: Helping Machine Translation Practitioners Prioritize Model Improvements

Machine learning (ML) models can fail in unexpected ways in the real wor...
research
02/24/2021

Practitioners' Perceptions of the Goals and Visual Explanations of Defect Prediction Models

Software defect prediction models are classifiers that are constructed f...
research
03/02/2023

Hyperparameter Tuning and Model Evaluation in Causal Effect Estimation

The performance of most causal effect estimators relies on accurate pred...
research
12/19/2018

Orchestrate: Infrastructure for Enabling Parallelism during Hyperparameter Optimization

Two key factors dominate the development of effective production grade m...
research
05/13/2022

Deconstructing NLG Evaluation: Evaluation Practices, Assumptions, and Their Implications

There are many ways to express similar things in text, which makes evalu...

Please sign up or login with your details

Forgot password? Click here to reset