Calibrating Black Box Classification Models through the Thresholding Method

05/20/2017
by   Arun Srinivasan, et al.
0

In high-dimensional classification settings, we wish to seek a balance between high power and ensuring control over a desired loss function. In many settings, the points most likely to be misclassified are those who lie near the decision boundary of the given classification method. Often, these uninformative points should not be classified as they are noisy and do not exhibit strong signals. In this paper, we introduce the Thresholding Method to parameterize the problem of determining which points exhibit strong signals and should be classified. We demonstrate the empirical performance of this novel calibration method in providing loss function control at a desired level, as well as explore how the method assuages the effect of overfitting. We explore the benefits of error control through the Thresholding Method in difficult, high-dimensional, simulated settings. Finally, we show the flexibility of the Thresholding Method through applying the method in a variety of real data settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/28/2017

Empirical Bayes Estimators for High-Dimensional Sparse Vectors

The problem of estimating a high-dimensional sparse vector θ∈R^n from an...
research
10/11/2022

Zeroth-Order Hard-Thresholding: Gradient Error vs. Expansivity

ℓ_0 constrained optimization is prevalent in machine learning, particula...
research
10/20/2014

On Iterative Hard Thresholding Methods for High-dimensional M-Estimation

The use of M-estimators in generalized linear regression models in high ...
research
04/28/2019

Support Vector Regression via a Combined Reward Cum Penalty Loss Function

In this paper, we introduce a novel combined reward cum penalty loss fun...
research
06/14/2019

Empirical study of extreme overfitting points of neural networks

In this paper we propose a method of obtaining points of extreme overfit...
research
10/09/2022

A Locally Adaptive Shrinkage Approach to False Selection Rate Control in High-Dimensional Classification

The uncertainty quantification and error control of classifiers are cruc...

Please sign up or login with your details

Forgot password? Click here to reset