Streaming Kernel Principal Component Analysis

12/16/2015
by   Mina Ghashami, et al.
0

Kernel principal component analysis (KPCA) provides a concise set of basis vectors which capture non-linear structures within large data sets, and is a central tool in data analysis and learning. To allow for non-linear relations, typically a full n × n kernel matrix is constructed over n data points, but this requires too much space and time for large values of n. Techniques such as the Nyström method and random feature maps can help towards this goal, but they do not explicitly maintain the basis vectors in a stream and take more space than desired. We propose a new approach for streaming KPCA which maintains a small set of basis elements in a stream, requiring space only logarithmic in n, and also improves the dependence on the error parameter. Our technique combines together random feature maps with recent advances in matrix sketching, it has guaranteed spectral norm error bounds with respect to the original kernel matrix, and it compares favorably in practice to state-of-the-art approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/20/2015

Kernel principal component analysis network for image classification

In order to classify the nonlinear feature with linear classifier and im...
research
08/02/2018

Streaming Kernel PCA with Õ(√(n)) Random Features

We study the statistical and computational aspects of kernel principal c...
research
02/12/2020

Structure-Property Maps with Kernel Principal Covariates Regression

Data analysis based on linear methods, which look for correlations betwe...
research
10/22/2012

Initialization of Self-Organizing Maps: Principal Components Versus Random Initialization. A Case Study

The performance of the Self-Organizing Map (SOM) algorithm is dependent ...
research
01/02/2017

Towards multiple kernel principal component analysis for integrative analysis of tumor samples

Personalized treatment of patients based on tissue-specific cancer subty...
research
02/18/2020

Learning Bijective Feature Maps for Linear ICA

Separating high-dimensional data like images into independent latent fac...
research
06/12/2023

Kernel Random Projection Depth for Outlier Detection

This paper proposes an extension of Random Projection Depth (RPD) to cop...

Please sign up or login with your details

Forgot password? Click here to reset