LeanML: A Design Pattern To Slash Avoidable Wastes in Machine Learning Projects

07/16/2021
by   Yves-Laurent Kom Samo, et al.
7

We introduce the first application of the lean methodology to machine learning projects. Similar to lean startups and lean manufacturing, we argue that lean machine learning (LeanML) can drastically slash avoidable wastes in commercial machine learning projects, reduce the business risk in investing in machine learning capabilities and, in so doing, further democratize access to machine learning. The lean design pattern we propose in this paper is based on two realizations. First, it is possible to estimate the best performance one may achieve when predicting an outcome y ∈𝒴 using a given set of explanatory variables x ∈𝒳, for a wide range of performance metrics, and without training any predictive model. Second, doing so is considerably easier, faster, and cheaper than learning the best predictive model. We derive formulae expressing the best R^2, MSE, classification accuracy, and log-likelihood per observation achievable when using x to predict y as a function of the mutual information I(y; x), and possibly a measure of the variability of y (e.g. its Shannon entropy in the case of classification accuracy, and its variance in the case regression MSE). We illustrate the efficacy of the LeanML design pattern on a wide range of regression and classification problems, synthetic and real-life.

READ FULL TEXT
research
01/02/2021

Minimum Viable Model Estimates for Machine Learning Projects

Prioritization of machine learning projects requires estimates of both t...
research
11/23/2022

Mutual Information Learned Regressor: an Information-theoretic Viewpoint of Training Regression Systems

As one of the central tasks in machine learning, regression finds lots o...
research
10/10/2021

Quadratic Multiform Separation: A New Classification Model in Machine Learning

In this paper we present a new classification model in machine learning....
research
01/07/2023

Machine Learning to Estimate Gross Loss of Jewelry for Wax Patterns

In mass manufacturing of jewellery, the gross loss is estimated before m...
research
08/17/2018

Benchmarking Automatic Machine Learning Frameworks

AutoML serves as the bridge between varying levels of expertise when des...
research
03/26/2023

Approaches to Improving the Accuracy of Machine Learning Models in Requirements Elicitation Techniques Selection

Selecting techniques is a crucial element of the business analysis appro...
research
07/28/2021

Multi Agent System for Machine Learning Under Uncertainty in Cyber Physical Manufacturing System

Recent advancements in predictive machine learning has led to its applic...

Please sign up or login with your details

Forgot password? Click here to reset