An Efficient Framework for Monitoring Subgroup Performance of Machine Learning Systems

12/16/2022
by   Huong Ha, et al.
0

Monitoring machine learning systems post deployment is critical to ensure the reliability of the systems. Particularly importance is the problem of monitoring the performance of machine learning systems across all the data subgroups (subpopulations). In practice, this process could be prohibitively expensive as the number of data subgroups grows exponentially with the number of input features, and the process of labelling data to evaluate each subgroup's performance is costly. In this paper, we propose an efficient framework for monitoring subgroup performance of machine learning systems. Specifically, we aim to find the data subgroup with the worst performance using a limited number of labeled data. We mathematically formulate this problem as an optimization problem with an expensive black-box objective function, and then suggest to use Bayesian optimization to solve this problem. Our experimental results on various real-world datasets and machine learning systems show that our proposed framework can retrieve the worst-performing data subgroup effectively and efficiently.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/28/2019

Computer Systems Have 99 Problems, Let's Not Make Machine Learning Another One

Machine learning techniques are finding many applications in computer sy...
research
04/28/2021

MLDemon: Deployment Monitoring for Machine Learning Systems

Post-deployment monitoring of the performance of ML systems is critical ...
research
02/12/2020

Collaborative Inference for Efficient Remote Monitoring

While current machine learning models have impressive performance over a...
research
09/28/2014

Combining human and machine learning for morphological analysis of galaxy images

The increasing importance of digital sky surveys collecting many million...
research
12/05/2022

Continual learning on deployment pipelines for Machine Learning Systems

Following the development of digitization, a growing number of large Ori...
research
09/19/2020

Recursive Experts: An Efficient Optimal Mixture of Learning Systems in Dynamic Environments

Sequential learning systems are used in a wide variety of problems from ...
research
05/22/2021

AutoLRS: Automatic Learning-Rate Schedule by Bayesian Optimization on the Fly

The learning rate (LR) schedule is one of the most important hyper-param...

Please sign up or login with your details

Forgot password? Click here to reset