A Topological-Framework to Improve Analysis of Machine Learning Model Performance

07/09/2021
by   Henry Kvinge, et al.
0

As both machine learning models and the datasets on which they are evaluated have grown in size and complexity, the practice of using a few summary statistics to understand model performance has become increasingly problematic. This is particularly true in real-world scenarios where understanding model failure on certain subpopulations of the data is of critical importance. In this paper we propose a topological framework for evaluating machine learning models in which a dataset is treated as a "space" on which a model operates. This provides us with a principled way to organize information about model performance at both the global level (over the entire test set) and also the local level (on specific subpopulations). Finally, we describe a topological data structure, presheaves, which offer a convenient way to store and analyze model performance between different subpopulations.

READ FULL TEXT
research
03/31/2023

Evaluation Challenges for Geospatial ML

As geospatial machine learning models and maps derived from their predic...
research
05/21/2021

Sheaves as a Framework for Understanding and Interpreting Model Fit

As data grows in size and complexity, finding frameworks which aid in in...
research
07/05/2018

A Boo(n) for Evaluating Architecture Performance

We point out important problems with the common practice of using the be...
research
01/30/2021

Importance of feature engineering and database selection in a machine learning model: A case study on carbon crystal structures

Drive towards improved performance of machine learning models has led to...
research
09/15/2022

Avoiding Biased Clinical Machine Learning Model Performance Estimates in the Presence of Label Selection

When evaluating the performance of clinical machine learning models, one...
research
05/31/2023

Evaluating Machine Learning Models with NERO: Non-Equivariance Revealed on Orbits

Proper evaluations are crucial for better understanding, troubleshooting...
research
10/15/2019

Shapley Homology: Topological Analysis of Sample Influence for Neural Networks

Data samples collected for training machine learning models are typicall...

Please sign up or login with your details

Forgot password? Click here to reset