Reasonable Scale Machine Learning with Open-Source Metaflow

03/21/2023
by   Jacopo Tagliabue, et al.
0

As Machine Learning (ML) gains adoption across industries and new use cases, practitioners increasingly realize the challenges around effectively developing and iterating on ML systems: reproducibility, debugging, scalability, and documentation are elusive goals for real-world pipelines outside tech-first companies. In this paper, we review the nature of ML-oriented workloads and argue that re-purposing existing tools won't solve the current productivity issues, as ML peculiarities warrant specialized development tooling. We then introduce Metaflow, an open-source framework for ML projects explicitly designed to boost the productivity of data practitioners by abstracting away the execution of ML code from the definition of the business logic. We show how our design addresses the main challenges in ML operations (MLOps), and document through examples, interviews and use cases its practical impact on the field.

READ FULL TEXT
research
06/22/2020

Machine Learning Pipelines: Provenance, Reproducibility and FAIR Data Principles

Machine learning (ML) is an increasingly important scientific tool suppo...
research
11/16/2022

XRBench: An Extended Reality (XR) Machine Learning Benchmark Suite for the Metaverse

Real-time multi-model multi-task (MMMT) workloads, a new form of deep le...
research
08/22/2021

Evaluation Methodologies for Code Learning Tasks

There has been a growing interest in developing machine learning (ML) mo...
research
05/04/2022

Machine Learning Operations (MLOps): Overview, Definition, and Architecture

The final goal of all industrial machine learning (ML) projects is to de...
research
07/16/2021

Declarative Machine Learning Systems

In the last years machine learning (ML) has moved from a academic endeav...
research
07/15/2021

You Do Not Need a Bigger Boat: Recommendations at Reasonable Scale in a (Mostly) Serverless and Open Stack

We argue that immature data pipelines are preventing a large portion of ...
research
07/05/2020

Participation is not a Design Fix for Machine Learning

This paper critically examines existing modes of participation in design...

Please sign up or login with your details

Forgot password? Click here to reset