An Upper Bound for the Distribution Overlap Index and Its Applications

12/16/2022
by   Hao Fu, et al.
0

This paper proposes an easy-to-compute upper bound for the overlap index between two probability distributions without requiring any knowledge of the distribution models. The computation of our bound is time-efficient and memory-efficient and only requires finite samples. The proposed bound shows its value in one-class classification and domain shift analysis. Specifically, in one-class classification, we build a novel one-class classifier by converting the bound into a confidence score function. Unlike most one-class classifiers, the training process is not needed for our classifier. Additionally, the experimental results show that our classifier can be accurate with only a small number of in-class samples and outperforms many state-of-the-art methods on various datasets in different one-class classification scenarios. In domain shift analysis, we propose a theorem based on our bound. The theorem is useful in detecting the existence of domain shift and inferring data information. The detection and inference processes are both computation-efficient and memory-efficient. Our work shows significant promise toward broadening the applications of overlap-based metrics.

READ FULL TEXT

page 7

page 10

research
04/27/2023

A Chain Rule for the Expected Suprema of Bernoulli Processes

We obtain an upper bound on the expected supremum of a Bernoulli process...
research
05/26/2022

Undersampling is a Minimax Optimal Robustness Intervention in Nonparametric Classification

While a broad range of techniques have been proposed to tackle distribut...
research
07/13/2023

A Novel Bayes' Theorem for Upper Probabilities

In their seminal 1990 paper, Wasserman and Kadane establish an upper bou...
research
01/25/2021

Approximating Probability Distributions by ReLU Networks

How many neurons are needed to approximate a target probability distribu...
research
05/11/2023

A tight upper bound on the number of non-zero weights of a constacyclic code

For a simple-root λ-constacyclic code 𝒞 over 𝔽_q, let ⟨ρ⟩ and ⟨ρ,M⟩ be t...
research
06/29/2016

Tighter bounds lead to improved classifiers

The standard approach to supervised classification involves the minimiza...
research
08/07/2015

Mismatch in the Classification of Linear Subspaces: Sufficient Conditions for Reliable Classification

This paper considers the classification of linear subspaces with mismatc...

Please sign up or login with your details

Forgot password? Click here to reset