Statistical Optimality and Computational Efficiency of Nyström Kernel PCA

05/19/2021
by   Nicholas Sterge, et al.
0

Kernel methods provide an elegant framework for developing nonlinear learning algorithms from simple linear methods. Though these methods have superior empirical performance in several real data applications, their usefulness is inhibited by the significant computational burden incurred in large sample situations. Various approximation schemes have been proposed in the literature to alleviate these computational issues, and the approximate kernel machines are shown to retain the empirical performance. However, the theoretical properties of these approximate kernel machines are less well understood. In this work, we theoretically study the trade-off between computational complexity and statistical accuracy in Nyström approximate kernel principal component analysis (KPCA), wherein we show that the Nyström approximate KPCA matches the statistical performance of (non-approximate) KPCA while remaining computationally beneficial. Additionally, we show that Nyström approximate KPCA outperforms the statistical behavior of another popular approximation scheme, the random feature approximation, when applied to KPCA.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/20/2017

Statistical Consistency of Kernel PCA with Random Features

Kernel methods are powerful learning methodologies that provide a simple...
research
12/11/2019

Large-scale Kernel Methods and Applications to Lifelong Robot Learning

As the size and richness of available datasets grow larger, the opportun...
research
07/11/2019

Gain with no Pain: Efficient Kernel-PCA by Nyström Sampling

In this paper, we propose and study a Nyström based approach to efficien...
research
10/11/2018

On Kernel Derivative Approximation with Random Fourier Features

Random Fourier features (RFF) represent one of the most popular and wide...
research
09/21/2014

Approximation errors of online sparsification criteria

Many machine learning frameworks, such as resource-allocating networks, ...
research
06/18/2012

A Linear Approximation to the chi^2 Kernel with Geometric Convergence

We propose a new analytical approximation to the χ^2 kernel that converg...
research
04/10/2017

On the Fine-Grained Complexity of Empirical Risk Minimization: Kernel Methods and Neural Networks

Empirical risk minimization (ERM) is ubiquitous in machine learning and ...

Please sign up or login with your details

Forgot password? Click here to reset