Well-Calibrated Probabilistic Predictive Maintenance using Venn-Abers

06/11/2023
by   Ulf Johansson, et al.
0

When using machine learning for fault detection, a common problem is the fact that most data sets are very unbalanced, with the minority class (a fault) being the interesting one. In this paper, we investigate the usage of Venn-Abers predictors, looking specifically at the effect on the minority class predictions. A key property of Venn-Abers predictors is that they output well-calibrated probability intervals. In the experiments, we apply Venn-Abers calibration to decision trees, random forests and XGBoost models, showing how both overconfident and underconfident models are corrected. In addition, the benefit of using the valid probability intervals produced by Venn-Abers for decision support is demonstrated. When using techniques producing opaque underlying models, e.g., random forest and XGBoost, each prediction will consist of not only the label, but also a valid probability interval, where the width is an indication of the confidence in the estimate. Adding Venn-Abers on top of a decision tree allows inspection and analysis of the model, to understand both the underlying relationship, and finding out in which parts of feature space that the model is accurate and/or confident.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/09/2021

Interpretable Machines: Constructing Valid Prediction Intervals with Random Forests

An important issue when using Machine Learning algorithms in recent rese...
research
08/11/2021

Trading Complexity for Sparsity in Random Forest Explanations

Random forests have long been considered as powerful model ensembles in ...
research
02/05/2022

Backtrack Tie-Breaking for Decision Trees: A Note on Deodata Predictors

A tie-breaking method is proposed for choosing the predicted class, or o...
research
11/06/2015

Finding structure in data using multivariate tree boosting

Technology and collaboration enable dramatic increases in the size of ps...
research
05/21/2022

Calibration of Natural Language Understanding Models with Venn–ABERS Predictors

Transformers, currently the state-of-the-art in natural language underst...
research
08/09/2022

Robust Scenario Interpretation from Multi-model Prediction Efforts

Multi-model prediction efforts in infectious disease modeling and climat...

Please sign up or login with your details

Forgot password? Click here to reset