On Efficient Range-Summability of IID Random Variables in Two or Higher Dimensions

10/14/2021
by   Jingfan Meng, et al.
0

d-dimensional efficient range-summability (dD-ERS) of a long list of random variables (RVs) is a fundamental algorithmic problem that has applications to two important families of database problems, namely, fast approximate wavelet tracking (FAWT) on data streams and approximately answering range-sum queries over a data cube. In this work, we propose a novel solution framework to dD-ERS for d>1 on RVs that have Gaussian or Poisson distribution. Our solutions are the first ones that compute any rectangular range-sum of the RVs in polylogarithmic time. Furthermore, we develop a novel k-wise independence theory that allows our dD-ERS solutions to have both high computational efficiencies and strong provable independence guarantees. Finally, we generalize existing DST-based solutions for 1D-ERS to 2D, and characterize a sufficient and likely necessary condition on the target distribution for this generalization to be feasible.

READ FULL TEXT

page 5

page 7

page 9

page 10

page 11

page 13

page 15

page 17

research
09/13/2021

A Dyadic Simulation Approach to Efficient Range-Summability

Efficient range-summability (ERS) of a long list of random variables is ...
research
04/17/2019

Remarks on the Rényi Entropy of a sum of IID random variables

In this note we study a conjecture of Madiman and Wang which predicted t...
research
11/28/2021

A Simple Necessary Condition For Independence of Real-Valued Random Variables

The standard method to check for the independence of two real-valued ran...
research
03/22/2023

Generalized Data Thinning Using Sufficient Statistics

Our goal is to develop a general strategy to decompose a random variable...
research
11/22/2021

A Dynamic Programming Algorithm to Compute Joint Distribution of Order Statistics on Graphs

Order statistics play a fundamental role in statistical procedures such ...
research
12/04/2017

Approximating the Sum of Independent Non-Identical Binomial Random Variables

The distribution of sum of independent non-identical binomial random var...
research
10/09/2017

Sum-Product Networks for Hybrid Domains

While all kinds of mixed data -from personal data, over panel and scient...

Please sign up or login with your details

Forgot password? Click here to reset