Model Assertions for Monitoring and Improving ML Model

03/03/2020
by   Daniel Kang, et al.
16

ML models are increasingly deployed in settings with real world interactions such as vehicles, but unfortunately, these models can fail in systematic ways. To prevent errors, ML engineering teams monitor and continuously improve these models. We propose a new abstraction, model assertions, that adapts the classical use of program assertions as a way to monitor and improve ML models. Model assertions are arbitrary functions over a model's input and output that indicate when errors may be occurring, e.g., a function that triggers if an object rapidly changes its class in a video. We propose methods of using model assertions at all stages of ML system deployment, including runtime monitoring, validating labels, and continuously improving ML models. For runtime monitoring, we show that model assertions can find high confidence errors, where a model returns the wrong output with high confidence, which uncertainty-based monitoring techniques would not detect. For training, we propose two methods of using model assertions. First, we propose a bandit-based active learning algorithm that can sample from data flagged by assertions and show that it can reduce labeling costs by up to 40 uncertainty-based methods. Second, we propose an API for generating "consistency assertions" (e.g., the class change example) and weak labels for inputs where the consistency assertions fail, and show that these weak labels can improve relative model quality by up to 46%. We evaluate model assertions on four real-world tasks with video, LIDAR, and ECG data.

READ FULL TEXT

page 2

page 14

research
03/03/2020

Model Assertions for Monitoring and Improving ML Models

ML models are increasingly deployed in settings with real world interact...
research
01/15/2022

Finding Label and Model Errors in Perception Data With Learned Observation Assertions

ML is being deployed in complex, real-world scenarios where errors have ...
research
02/07/2019

ML Health: Fitness Tracking for Production Models

Deployment of machine learning (ML) algorithms in production for extende...
research
07/29/2021

Did the Model Change? Efficiently Assessing Machine Learning API Shifts

Machine learning (ML) prediction APIs are increasingly widely used. An M...
research
04/28/2021

MLDemon: Deployment Monitoring for Machine Learning Systems

Post-deployment monitoring of the performance of ML systems is critical ...
research
10/06/2021

Tribuo: Machine Learning with Provenance in Java

Machine Learning models are deployed across a wide range of industries, ...
research
04/15/2021

Compressive time-lapse seismic monitoring of carbon storage and sequestration with the joint recovery model

Time-lapse seismic monitoring of carbon storage and sequestration is oft...

Please sign up or login with your details

Forgot password? Click here to reset