Explainable Online Validation of Machine Learning Models for Practical Applications

by   Wolfgang Fuhl, et al.

We present a reformulation of the regression and classification, which aims to validate the result of a machine learning algorithm. Our reformulation simplifies the original problem and validates the result of the machine learning algorithm using the training data. Since the validation of machine learning algorithms must always be explainable, we perform our experiments with the kNN algorithm as well as with an algorithm based on conditional probabilities, which is proposed in this work. For the evaluation of our approach, three publicly available data sets were used and three classification and two regression problems were evaluated. The presented algorithm based on conditional probabilities is also online capable and requires only a fraction of memory compared to the kNN algorithm.


Cyclic Boosting – an explainable supervised machine learning algorithm

Supervised machine learning algorithms have seen spectacular advances an...

Modeling Generalization in Machine Learning: A Methodological and Computational Study

As machine learning becomes more and more available to the general publi...

The Stochastic Replica Approach to Machine Learning: Stability and Parameter Optimization

We introduce a statistical physics inspired supervised machine learning ...

Sample selection from a given dataset to validate machine learning models

The selection of a validation basis from a full dataset is often require...

Rapid Learning with Stochastic Focus of Attention

We present a method to stop the evaluation of a decision making process ...

Fairness Measures for Regression via Probabilistic Classification

Algorithmic fairness involves expressing notions such as equity, or reas...

Fast, accurate, and transferable many-body interatomic potentials by symbolic regression

The length and time scales of atomistic simulations are limited by the c...