Sparse Approximation of a Kernel Mean

03/01/2015
by   E. Cruz Cortés, et al.
0

Kernel means are frequently used to represent probability distributions in machine learning problems. In particular, the well known kernel density estimator and the kernel mean embedding both have the form of a kernel mean. Unfortunately, kernel means are faced with scalability issues. A single point evaluation of the kernel density estimator, for example, requires a computation time linear in the training sample size. To address this challenge, we present a method to efficiently construct a sparse approximation of a kernel mean. We do so by first establishing an incoherence-based bound on the approximation error, and then noticing that, for the case of radial kernels, the bound can be minimized by solving the k-center problem. The outcome is a linear time construction of a sparse kernel mean, which also lends itself naturally to an automatic sparsity selection scheme. We show the computational gains of our method by looking at three problems involving kernel means: Euclidean embedding of distributions, class proportion estimation, and clustering using the mean-shift algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/10/2021

Kernel Mean Estimation by Marginalized Corrupted Distributions

Estimating the kernel mean in a reproducing kernel Hilbert space is a cr...
research
03/28/2014

Characteristic Kernels and Infinitely Divisible Distributions

We connect shift-invariant characteristic kernels to infinitely divisibl...
research
12/23/2019

Kernel Embedding Linear Response

In the paper, we study the problem of estimating linear response statist...
research
06/04/2015

An Average Classification Algorithm

Many classification algorithms produce a classifier that is a weighted a...
research
02/13/2018

MONK -- Outlier-Robust Mean Embedding Estimation by Median-of-Means

Mean embeddings provide an extremely flexible and powerful tool in machi...
research
01/31/2022

Nyström Kernel Mean Embeddings

Kernel mean embeddings are a powerful tool to represent probability dist...
research
12/16/2019

Finding the Mode of a Kernel Density Estimate

Given points p_1, ..., p_n in R^d, how do we find a point x which maximi...

Please sign up or login with your details

Forgot password? Click here to reset