Metric Learning Improves the Ability of Combinatorial Coverage Metrics to Anticipate Classification Error

02/28/2023
by   Tyler Cody, et al.
1

Machine learning models are increasingly used in practice. However, many machine learning methods are sensitive to test or operational data that is dissimilar to training data. Out-of-distribution (OOD) data is known to increase the probability of error and research into metrics that identify what dissimilarities in data affect model performance is on-going. Recently, combinatorial coverage metrics have been explored in the literature as an alternative to distribution-based metrics. Results show that coverage metrics can correlate with classification error. However, other results show that the utility of coverage metrics is highly dataset-dependent. In this paper, we show that this dataset-dependence can be alleviated with metric learning, a machine learning technique for learning latent spaces where data from different classes is further apart. In a study of 6 open-source datasets, we find that metric learning increased the difference between set-difference coverage metrics (SDCCMs) calculated on correctly and incorrectly classified data, thereby demonstrating that metric learning improves the ability of SDCCMs to anticipate classification error. Paired t-tests validate the statistical significance of our findings. Overall, we conclude that metric learning improves the ability of coverage metrics to anticipate classifier error and identify when OOD data is likely to degrade model performance.

READ FULL TEXT
research
04/15/2014

Sparse Compositional Metric Learning

We propose a new approach for metric learning by framing it as learning ...
research
01/28/2022

Systematic Training and Testing for Machine Learning Using Combinatorial Interaction Testing

This paper demonstrates the systematic use of combinatorial coverage for...
research
12/20/2021

Calabi-Yau Metrics, Energy Functionals and Machine-Learning

We apply machine learning to the problem of finding numerical Calabi-Yau...
research
02/13/2023

Transferable Deep Metric Learning for Clustering

Clustering in high dimension spaces is a difficult task; the usual dista...
research
05/11/2019

A Distributed Approach towards Discriminative Distance Metric Learning

Distance metric learning is successful in discovering intrinsic relation...
research
02/09/2018

Metric Learning via Maximizing the Lipschitz Margin Ratio

In this paper, we propose the Lipschitz margin ratio and a new metric le...
research
01/03/2020

Decomposable Probability-of-Success Metrics in Algorithmic Search

Previous studies have used a specific success metric within an algorithm...

Please sign up or login with your details

Forgot password? Click here to reset