Operationalizing Machine Learning: An Interview Study

09/16/2022
by   Shreya Shankar, et al.
9

Organizations rely on machine learning engineers (MLEs) to operationalize ML, i.e., deploy and maintain ML pipelines in production. The process of operationalizing ML, or MLOps, consists of a continual loop of (i) data collection and labeling, (ii) experimentation to improve ML performance, (iii) evaluation throughout a multi-staged deployment process, and (iv) monitoring of performance drops in production. When considered together, these responsibilities seem staggering – how does anyone do MLOps, what are the unaddressed challenges, and what are the implications for tool builders? We conducted semi-structured ethnographic interviews with 18 MLEs working across many applications, including chatbots, autonomous vehicles, and finance. Our interviews expose three variables that govern success for a production ML deployment: Velocity, Validation, and Versioning. We summarize common practices for successful ML experimentation, deployment, and sustaining production performance. Finally, we discuss interviewees' pain points and anti-patterns, with implications for tool design.

READ FULL TEXT
research
03/30/2021

Production Machine Learning Pipelines: Empirical Analysis and Optimization Opportunities

Machine learning (ML) is now commonplace, powering data-driven applicati...
research
10/14/2021

Looper: An end-to-end ML platform for product decisions

Modern software systems and products increasingly rely on machine learni...
research
05/08/2021

Chameleon: A Semi-AutoML framework targeting quick and scalable development and deployment of production-ready ML systems for SMEs

Developing, scaling, and deploying modern Machine Learning solutions rem...
research
12/04/2018

Expanding search in the space of empirical ML

As researchers and practitioners of applied machine learning, we are giv...
research
03/10/2023

Moving Fast With Broken Data

Machine learning (ML) models in production pipelines are frequently retr...
research
07/11/2022

Documenting Data Production Processes: A Participatory Approach for Data Work

The opacity of machine learning data is a significant threat to ethical ...
research
09/07/2021

Amazon SageMaker Clarify: Machine Learning Bias Detection and Explainability in the Cloud

Understanding the predictions made by machine learning (ML) models and t...

Please sign up or login with your details

Forgot password? Click here to reset