Exact confidence interval for generalized Flajolet-Martin algorithms

09/25/2019
by   Giacomo Aletti, et al.
0

This paper develop a deep mathematical-statistical approach to analyze a class of Flajolet-Martin algorithms (FMa), and provide a exact analytical confidence interval for the number F_0 of distinct elements in a stream, based on Chernoff bounds. The class of FMa has reached a significant popularity in bigdata stream learning, and the attention of the literature has mainly been based on algorithmic aspects, basically complexity optimality, while the statistical analysis of these class of algorithms has been often faced heuristically. The analysis provided here shows a deep connections with special mathematical functions and with extreme value theory. The latter connection may help in explaining heuristic considerations, while the first opens many numerical issues, faced at the end of the present paper. Finally, MonteCarlo simulations are provided to support our analytical choice in this context.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/26/2021

A Unified Approach for Constructing Confidence Intervals and Hypothesis Tests Using h-function

We introduce a general method, named the h-function method, to unify the...
research
11/25/2021

Exact Confidence Bounds in Discrete Models – Algorithmic Aspects of Sterne's Method

In this manuscript we review two methods to construct exact confidence b...
research
04/10/2021

Exact-corrected confidence interval for risk difference in noninferiority binomial trials

A novel confidence interval estimator is proposed for the risk differenc...
research
06/08/2023

Analysis of Knuth's Sampling Algorithm D and D'

In this research paper, we address the Distinct Elements estimation prob...
research
12/03/2019

Mathematical aspects relative to the fluid statics of a self-gravitating perfect-gas isothermal sphere

In the present paper we analyze and discuss some mathematical aspects of...
research
02/14/2023

A Framework for Mediation Analysis with Massive Data

During the past few years, mediation analysis has gained increasing popu...
research
11/15/2021

Orthounimodal Distributionally Robust Optimization: Representation, Computation and Multivariate Extreme Event Applications

This paper studies a basic notion of distributional shape known as ortho...

Please sign up or login with your details

Forgot password? Click here to reset