Facility Locations Utility for Uncovering Classifier Overconfidence

10/12/2018
by   Karsten Maurer, et al.
0

Assessing the predictive accuracy of black box classifiers is challenging in the absence of labeled test datasets. In these scenarios we may need to rely on a human oracle to evaluate individual predictions; presenting the challenge to create query algorithms to guide the search for points that provide the most information about the classifier's predictive characteristics. Previous works have focused on developing utility models and query algorithms for discovering unknown unknowns --- misclassifications with a predictive confidence above some arbitrary threshold. However, if misclassifications occur at the rate reflected by the confidence values, then these search methods reveal nothing more than a proper assessment of predictive certainty. We are unable to properly mitigate the risks associated with model deficiency when the model's confidence in prediction exceeds the actual model accuracy. We propose a facility locations utility model and corresponding greedy query algorithm that instead searches for overconfident unknown unknowns. Through robust empirical experiments we demonstrate that the greedy query algorithm with the facility locations utility model consistently results in oracle queries with superior performance in discovering overconfident unknown unknowns than previous methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/28/2016

Identifying Unknown Unknowns in the Open World: Representations and Policies for Guided Exploration

Predictive models deployed in the real world may assign incorrect labels...
research
06/29/2020

Harnessing Adversarial Distances to Discover High-Confidence Errors

Given a deep neural network image classification model that we treat as ...
research
11/19/2019

Sequential Mode Estimation with Oracle Queries

We consider the problem of adaptively PAC-learning a probability distrib...
research
08/17/2022

An Evolutionary, Gradient-Free, Query-Efficient, Black-Box Algorithm for Generating Adversarial Instances in Deep Networks

Deep neural networks (DNNs) are sensitive to adversarial data in a varie...
research
12/23/2017

Query-limited Black-box Attacks to Classifiers

We study black-box attacks on machine learning classifiers where each qu...
research
02/06/2023

Variational Information Pursuit for Interpretable Predictions

There is a growing interest in the machine learning community in develop...
research
06/11/2018

Learning to Speed Up Structured Output Prediction

Predicting structured outputs can be computationally onerous due to the ...

Please sign up or login with your details

Forgot password? Click here to reset