Interpretable Meta-Measure for Model Performance

06/02/2020
by   Alicja Gosiewska, et al.
0

Measures for evaluation of model performance play an important role in Machine Learning. However, the most common performance measures share several limitations. The difference in performance for two models has no probabilistic interpretation and there is no reference point to indicate whether they represent a significant improvement. What is more, it makes no sense to compare such differences between data sets. In this article, we introduce a new meta-measure for performance assessment named Elo-based Predictive Power (EPP). The differences in EPP scores have probabilistic interpretation and can be directly compared between data sets. We prove the mathematical properties of EPP and support them with empirical results of a large scale benchmark on 30 classification data sets. Finally, we show applications of EPP to the selected meta-learning problems and challenges beyond ML benchmarks.

READ FULL TEXT
research
08/24/2019

EPP: interpretable score of model predictive power

The most important part of model selection and hyperparameter tuning is ...
research
08/21/2023

Foundation Model-oriented Robustness: Robust Image Model Evaluation with Pretrained Models

Machine learning has demonstrated remarkable performance over finite dat...
research
06/28/2021

Explaining the Performance of Multi-label Classification Methods with Data Set Properties

Meta learning generalizes the empirical experience with different learni...
research
01/03/2018

An Analysis of Two Common Reference Points for EEGs

Clinical electroencephalographic (EEG) data varies significantly dependi...
research
10/18/2021

Learning Prototype-oriented Set Representations for Meta-Learning

Learning from set-structured data is a fundamental problem that has rece...
research
09/25/2020

Adjusted Measures for Feature Selection Stability for Data Sets with Similar Features

For data sets with similar features, for example highly correlated featu...
research
02/08/2022

The Lifecycle of a Statistical Model: Model Failure Detection, Identification, and Refitting

The statistical machine learning community has demonstrated considerable...

Please sign up or login with your details

Forgot password? Click here to reset