Second-Order Asymptotically Optimal Statistical Classification

06/03/2018
by   Lin Zhou, et al.
0

Motivated by real-world machine learning applications, we analyze approximations to the non-asymptotic fundamental limits of statistical classification. In the binary version of this problem, given two training sequences generated according to two unknown distributions P_1 and P_2, one is tasked to classify a test sequence which is known to be generated according to either P_1 or P_2. This problem can be thought of as an analogue of the binary hypothesis testing problem but in the present setting, the generating distributions are unknown. Due to finite sample considerations, we consider the second-order asymptotics (or dispersion-type) tradeoff between type-I and type-II error probabilities for tests which ensure that (i) the type-I error probability for all pairs of distributions decays exponentially fast and (ii) the type-II error probability for a particular pair of distributions is non-vanishing. We generalize our results to classification of multiple hypotheses with the rejection option.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/08/2020

Second-Order Asymptotically Optimal Universal Outlying Sequence Detection with Reject Option

Motivated by practical machine learning applications, we revisit the out...
research
12/03/2019

Sequential Classification with Empirically Observed Statistics

Motivated by real-world machine learning applications, we consider a sta...
research
10/08/2019

Evaluation of Error Probability of Classification Based on the Analysis of the Bayes Code

Suppose that we have two training sequences generated by parametrized di...
research
07/22/2022

Statistical Hypothesis Testing Based on Machine Learning: Large Deviations Analysis

We study the performance – and specifically the rate at which the error ...
research
01/16/2023

Large Deviations for Classification Performance Analysis of Machine Learning Systems

We study the performance of machine learning binary classification techn...
research
02/28/2011

Neyman-Pearson classification, convexity and stochastic constraints

Motivated by problems of anomaly detection, this paper implements the Ne...
research
03/14/2019

Distributed Detection with Empirically Observed Statistics

We consider a binary distributed detection problem in which the distribu...

Please sign up or login with your details

Forgot password? Click here to reset