Streaming Kernel PCA Algorithm With Small Space

03/08/2023
by   Yichuan Deng, et al.
0

Principal Component Analysis (PCA) is a widely used technique in machine learning, data analysis and signal processing. With the increase in the size and complexity of datasets, it has become important to develop low-space usage algorithms for PCA. Streaming PCA has gained significant attention in recent years, as it can handle large datasets efficiently. The kernel method, which is commonly used in learning algorithms such as Support Vector Machines (SVMs), has also been applied in PCA algorithms. We propose a streaming algorithm for Kernel PCA problems based on the traditional scheme by Oja. Our algorithm addresses the challenge of reducing the memory usage of PCA while maintaining its accuracy. We analyze the performance of our algorithm by studying the conditions under which it succeeds. Specifically, we show that, when the spectral ratio R := λ_1/λ_2 of the target covariance matrix is lower bounded by C ·log n·log d, the streaming PCA can be solved with O(d) space cost. Our proposed algorithm has several advantages over existing methods. First, it is a streaming algorithm that can handle large datasets efficiently. Second, it employs the kernel method, which allows it to capture complex nonlinear relationships among data points. Third, it has a low-space usage, making it suitable for applications where memory is limited.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/15/2018

History PCA: A New Algorithm for Streaming PCA

In this paper we propose a new algorithm for streaming principal compone...
research
12/21/2016

Robust Learning with Kernel Mean p-Power Error Loss

Correntropy is a second order statistical measure in kernel space, which...
research
01/31/2018

Incremental kernel PCA and the Nyström method

Incremental versions of batch algorithms are often desired, for increase...
research
05/15/2011

Spectrum Sensing for Cognitive Radio Using Kernel-Based Learning

Kernel method is a very powerful tool in machine learning. The trick of ...
research
05/27/2023

Improved Privacy-Preserving PCA Using Space-optimized Homomorphic Matrix Multiplication

Principal Component Analysis (PCA) is a pivotal technique in the fields ...
research
02/23/2016

An Improved Gap-Dependency Analysis of the Noisy Power Method

We consider the noisy power method algorithm, which has wide application...
research
06/12/2018

Streaming PCA and Subspace Tracking: The Missing Data Case

For many modern applications in science and engineering, data are collec...

Please sign up or login with your details

Forgot password? Click here to reset