Neyman-Pearson Classification under High-Dimensional Settings

08/13/2015
by   Anqi Zhao, et al.
0

Most existing binary classification methods target on the optimization of the overall classification risk and may fail to serve some real-world applications such as cancer diagnosis, where users are more concerned with the risk of misclassifying one specific class than the other. Neyman-Pearson (NP) paradigm was introduced in this context as a novel statistical framework for handling asymmetric type I/II error priorities. It seeks classifiers with a minimal type II error and a constrained type I error under a user specified level. This article is the first attempt to construct classifiers with guaranteed theoretical performance under the NP paradigm in high-dimensional settings. Based on the fundamental Neyman-Pearson Lemma, we used a plug-in approach to construct NP-type classifiers for Naive Bayes models. The proposed classifiers satisfy the NP oracle inequalities, which are natural NP paradigm counterparts of the oracle inequalities in classical binary classification. Besides their desirable theoretical properties, we also demonstrated their numerical advantages in prioritized error control via both simulation and real data studies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/07/2018

Sparse Linear Discriminant Analysis under the Neyman-Pearson Paradigm

In contrast to the classical binary classification paradigm that minimiz...
research
02/28/2011

Neyman-Pearson classification, convexity and stochastic constraints

Motivated by problems of anomaly detection, this paper implements the Ne...
research
11/08/2021

Neyman-Pearson Multi-class Classification via Cost-sensitive Learning

Most existing classification methods aim to minimize the overall misclas...
research
05/16/2007

Lasso type classifiers with a reject option

We consider the problem of binary classification where one can, for a pa...
research
03/12/2019

Neyman-Pearson Criterion (NPC): A Model Selection Criterion for Asymmetric Binary Classification

We propose a new model selection criterion, the Neyman-Pearson criterion...
research
06/24/2023

Robust Classification of High-Dimensional Data using Data-Adaptive Energy Distance

Classification of high-dimensional low sample size (HDLSS) data poses a ...
research
02/07/2018

Intentional control of type I error over unconscious data distortion: a Neyman-Pearson classification approach

The rise of social media enables millions of citizens to generate inform...

Please sign up or login with your details

Forgot password? Click here to reset