Data Structures for Density Estimation

06/20/2023
by   Anders Aamand, et al.
0

We study statistical/computational tradeoffs for the following density estimation problem: given k distributions v_1, …, v_k over a discrete domain of size n, and sampling access to a distribution p, identify v_i that is "close" to p. Our main result is the first data structure that, given a sublinear (in n) number of samples from p, identifies v_i in time sublinear in k. We also give an improved version of the algorithm of Acharya et al. (2018) that reports v_i in time linear in k. The experimental evaluation of the latter algorithm shows that it achieves a significant reduction in the number of operations needed to achieve a given accuracy compared to prior work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/08/2022

Dynamic Maintenance of Kernel Density Estimation Data Structure: From Practice to Theory

Kernel density estimation (KDE) stands out as a challenging task in mach...
research
11/09/2022

Sampling an Edge in Sublinear Time Exactly and Optimally

Sampling edges from a graph in sublinear time is a fundamental problem a...
research
02/14/2023

Private Statistical Estimation of Many Quantiles

This work studies the estimation of many statistical quantiles under dif...
research
04/08/2021

Linear and Sublinear Time Spectral Density Estimation

We analyze the popular kernel polynomial method (KPM) for approximating ...
research
02/24/2022

A Dynamic Fast Gaussian Transform

The Fast Gaussian Transform (FGT) enables subquadratic-time multiplicati...
research
06/30/2021

A Statistical Taylor Theorem and Extrapolation of Truncated Densities

We show a statistical version of Taylor's theorem and apply this result ...
research
12/01/2022

Sub-quadratic Algorithms for Kernel Matrices via Kernel Density Estimation

Kernel matrices, as well as weighted graphs represented by them, are ubi...

Please sign up or login with your details

Forgot password? Click here to reset