Kernel Density Estimation Bias under Minimal Assumptions

01/02/2019
by   Maciej Skorski, et al.
0

Kernel Density Estimation is a very popular technique of approximating a density function from samples. The accuracy is generally well-understood and depends, roughly speaking, on the kernel decay and local smoothness of the true density. However concrete statements in the literature are often invoked in very specific settings (simplified or overly conservative assumptions) or miss important but subtle points (e.g. it is common to heuristically apply Taylor's expansion globally without referring to compactness). The contribution of this paper is twofold (a) we demonstrate that, when the bandwidth is an arbitrary invertible matrix going to zero, it is necessary to keep a certain balance between the kernel decay and magnitudes of bandwidth eigenvalues; in fact, without the sufficient decay the estimates may not be even bounded (b) we give a rigorous derivation of bounds with explicit constants for the bias, under possibly minimal assumptions. This connects the kernel decay, bandwidth norm, bandwidth determinant and density smoothness. It has been folklore that the issue with Taylor's formula can be fixed with more complicated assumptions on the density (for example p. 95 of "Kernel Smoothing" by Wand and Jones); we show that this is actually not necessary and can be handled by the kernel decay alone.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2017

Consistent Kernel Density Estimation with Non-Vanishing Bandwidth

Consistency of the kernel density estimator requires that the kernel ban...
research
03/25/2019

β-Divergence loss for the kernel density estimation with bias reduced

Allthough nonparametric kernel density estimation with bias reduce is no...
research
04/26/2021

Data-Based Optimal Bandwidth for Kernel Density Estimation of Statistical Samples

It is a common practice to evaluate probability density function or matt...
research
07/03/2023

A maximal inequality for local empirical processes under weak dependence

We introduce a maximal inequality for a local empirical process under st...
research
10/25/2018

Adaptive Density Estimation on Bounded Domains

We study the estimation, in Lp-norm, of density functions defined on [0,...
research
02/14/2011

Dual-Tree Fast Gauss Transforms

Kernel density estimation (KDE) is a popular statistical technique for e...
research
06/27/2012

Faster Gaussian Summation: Theory and Experiment

We provide faster algorithms for the problem of Gaussian summation, whic...

Please sign up or login with your details

Forgot password? Click here to reset