Dynamic Maintenance of Kernel Density Estimation Data Structure: From Practice to Theory

08/08/2022
by   Jiehao Liang, et al.
0

Kernel density estimation (KDE) stands out as a challenging task in machine learning. The problem is defined in the following way: given a kernel function f(x,y) and a set of points {x_1, x_2, ⋯, x_n }⊂ℝ^d, we would like to compute 1/n∑_i=1^n f(x_i,y) for any query point y ∈ℝ^d. Recently, there has been a growing trend of using data structures for efficient KDE. However, the proposed KDE data structures focus on static settings. The robustness of KDE data structures over dynamic changing data distributions is not addressed. In this work, we focus on the dynamic maintenance of KDE data structures with robustness to adversarial queries. Especially, we provide a theoretical framework of KDE data structures. In our framework, the KDE data structures only require subquadratic spaces. Moreover, our data structure supports the dynamic update of the dataset in sublinear time. Furthermore, we can perform adaptive queries with the potential adversary in sublinear time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/30/2018

Hashing-Based-Estimators for Kernel Density in High Dimensions

Given a set of points P⊂R^d and a kernel k, the Kernel Density Estimate ...
research
06/20/2023

Data Structures for Density Estimation

We study statistical/computational tradeoffs for the following density e...
research
02/24/2022

A Dynamic Fast Gaussian Transform

The Fast Gaussian Transform (FGT) enables subquadratic-time multiplicati...
research
10/20/2022

Sketching Meets Differential Privacy: Fast Algorithm for Dynamic Kronecker Projection Maintenance

Projection maintenance is one of the core data structure tasks. Efficien...
research
05/31/2018

Tokenized Data Markets

We formalize the construction of decentralized data markets by introduci...
research
04/30/2021

Speeding up Python-based Lagrangian Fluid-Flow Particle Simulations via Dynamic Collection Data Structures

Array-like collection data structures are widely established in Python's...
research
02/07/2018

Optimal data structures for stochastic driven simulations

Simulations where we have some prior information on the probability dist...

Please sign up or login with your details

Forgot password? Click here to reset