Fast and stable multivariate kernel density estimation by fast sum updating

12/04/2017
by   Nicolas Langrené, et al.
0

Kernel density estimation and kernel regression are powerful but computationally expensive techniques: a direct evaluation of kernel density estimates at M evaluation points given N input sample points requires a quadratic O(MN) operations, which is prohibitive for large scale problems. For this reason, approximate methods such as binning with Fast Fourier Transform or the Fast Gauss Transform have been proposed to speed up kernel density estimation. Among these fast methods, the Fast Sum Updating approach is an attractive alternative, as it is an exact method and its speed is independent from the input sample and the bandwidth. Unfortunately, this method, based on data sorting, has for the most part been limited to the univariate case. In this paper, we revisit the fast sum updating approach and extend it in several ways. Our main contribution is to extend it to the general multivariate case for general input data and rectilinear evaluation grid. Other contributions include its extension to a wider class of kernels, including the triangular, cosine and Silverman kernels, and its combination with a fast approximate k-nearest-neighbors bandwidth for multivariate datasets. Our numerical tests of multivariate regression and density estimation confirm the speed, accuracy and stability of the method. We hope this paper will renew interest for the fast sum updating approach and help solve large scale practical density estimation and regression problems.

READ FULL TEXT

page 9

page 37

research
05/07/2020

Fast multivariate empirical cumulative distribution function with connection to kernel density estimation

This paper revisits the problem of computing empirical cumulative distri...
research
06/02/2018

Fast Exact Univariate Kernel Density Estimation

This paper presents new methodology for computationally efficient kernel...
research
04/16/2022

FKreg: A MATLAB toolbox for fast Multivariate Kernel Regression

Kernel smooth is the most fundamental technique for data density and reg...
research
04/18/2015

Fast optimization of Multithreshold Entropy Linear Classifier

Multithreshold Entropy Linear Classifier (MELC) is a density based model...
research
06/08/2016

Fast and Extensible Online Multivariate Kernel Density Estimation

We present xokde++, a state-of-the-art online kernel density estimation ...
research
02/14/2011

Dual-Tree Fast Gauss Transforms

Kernel density estimation (KDE) is a popular statistical technique for e...
research
09/03/2018

Sea Clutter Distribution Modeling: A Kernel Density Estimation Approach

An accurate sea clutter distribution is crucial for decision region dete...

Please sign up or login with your details

Forgot password? Click here to reset