Concentration Bounds for the Collision Estimator

06/12/2020
by   Maciej Skorski, et al.
0

We prove a strong concentration result about the natural collision estimator, which counts the number of collisions that occur within an iid sample. This estimator is at the heart of algorithms used for uniformity testing and entropy assessment. While the prior works were limited to only variance, we use elegant techniques of independent interest to bounds higher moments and conclude concentration properties. As an immediate corollary we show that the estimator achieves high-probability guarantee on its own and there is no need for boosting (aka median/majority trick).

READ FULL TEXT
research
06/12/2020

Stochastic Analysis of Collision Estimator

We prove a strong concentration result about the collision estimator, wh...
research
04/20/2020

Estimating Ising Models from One Sample

Given one sample X ∈{± 1}^n from an Ising model [X=x]∝(x^ J x/2), whose ...
research
11/04/2021

A Concentration Bound for LSPE(λ)

The popular LSPE(λ) algorithm for policy evaluation is revisited to deri...
research
03/30/2023

The Graphical Nadaraya-Watson Estimator on Latent Position Models

Given a graph with a subset of labeled nodes, we are interested in the q...
research
04/02/2020

No Repetition: Fast Streaming with Highly Concentrated Hashing

To get estimators that work within a certain error bound with high proba...
research
06/09/2020

How Robust is the Median-of-Means? Concentration Bounds in Presence of Outliers

In contrast to the empirical mean, the Median-of-Means (MoM) is an estim...
research
02/03/2021

CountSketches, Feature Hashing and the Median of Three

In this paper, we revisit the classic CountSketch method, which is a spa...

Please sign up or login with your details

Forgot password? Click here to reset