Novel Bernstein-like Concentration Inequalities for the Missing Mass

We are concerned with obtaining novel concentration inequalities for the missing mass, i.e. the total probability mass of the outcomes not observed in the sample. We not only derive - for the first time - distribution-free Bernstein-like deviation bounds with sublinear exponents in deviation size for missing mass, but also improve the results of McAllester and Ortiz (2003) andBerend and Kontorovich (2013, 2012) for small deviations which is the most interesting case in learning theory. It is known that the majority of standard inequalities cannot be directly used to analyze heterogeneous sums i.e. sums whose terms have large difference in magnitude. Our generic and intuitive approach shows that the heterogeneity issue introduced in McAllester and Ortiz (2003) is resolvable at least in the case of missing mass via regulating the terms using our novel thresholding technique.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/20/2015

A Bennett Inequality for the Missing Mass

Novel concentration inequalities are obtained for the missing mass, i.e....
research
02/25/2014

Novel Deviation Bounds for Mixture of Independent Bernoulli Variables with Application to the Missing Mass

In this paper, we are concerned with obtaining distribution-free concent...
research
05/19/2020

Revisiting Concentration of Missing Mass

We revisit the problem of missing mass concentration, deriving Bernstein...
research
10/05/2021

Estimation and Concentration of Missing Mass of Functions of Discrete Probability Distributions

Given a positive function g from [0,1] to the reals, the function's miss...
research
02/27/2019

Consistent estimation of the missing mass for feature models

Feature models are popular in machine learning and they have been recent...
research
03/12/2015

On the Impossibility of Learning the Missing Mass

This paper shows that one cannot learn the probability of rare events wi...
research
07/19/2018

Sparse space-time models: Concentration Inequalities and Lasso

Inspired by Kalikow-type decompositions, we introduce a new stochastic m...

Please sign up or login with your details

Forgot password? Click here to reset