Asymptotics for Outlier Hypothesis Testing

01/23/2022
by   Lin Zhou, et al.
0

We revisit the outlier hypothesis testing framework of Li et al. (TIT 2014) and derive fundamental limits for the optimal test. In outlier hypothesis testing, one is given multiple observed sequences, where most sequences are generated i.i.d. from a nominal distribution. The task is to discern the set of outlying sequences that are generated according to anomalous distributions. The nominal and anomalous distributions are unknown. We consider the case of multiple outliers where the number of outliers is unknown and each outlier can follow a different anomalous distribution. Under this setting, we study the tradeoff among the probabilities of misclassification error, false alarm and false reject. Specifically, we propose a threshold-based test that ensures exponential decay of misclassification error and false alarm probabilities. We study two constraints on the false reject probability, with one constraint being that it is a non-vanishing constant and the other being that it has an exponential decay rate. For both cases, we characterize bounds on the false reject probability, as a function of the threshold, for each tuple of nominal and anomalous distributions. Finally, we demonstrate the asymptotic optimality of our test under the generalized Neyman-Pearson criterion.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/08/2020

Second-Order Asymptotically Optimal Universal Outlying Sequence Detection with Reject Option

Motivated by practical machine learning applications, we revisit the out...
research
06/09/2021

Statistical Classification via Robust Hypothesis Testing

In this letter, we consider multiple statistical classification problem ...
research
09/27/2022

Hypothesis Testing for Detecting Outlier Evaluators

In epidemiological studies, very often, evaluators obtain measurements o...
research
08/25/2021

Testing for directed information graphs

In this paper, we study a hypothesis test to determine the underlying di...
research
12/15/2020

Optimal ROC Curves from Score Variable Threshold Tests

The Receiver Operating Characteristic (ROC) is a well-established repres...
research
07/22/2022

Statistical Hypothesis Testing Based on Machine Learning: Large Deviations Analysis

We study the performance – and specifically the rate at which the error ...
research
12/07/2022

Criterion for the resemblance between the mother and the model distribution

If the probability distribution model aims to approximate the hidden mot...

Please sign up or login with your details

Forgot password? Click here to reset