Sketching for Principal Component Regression

03/07/2018
by   Liron Mor-Yosef, et al.
0

Principal component regression (PCR) is a useful method for regularizing linear regression. Although conceptually simple, straightforward implementations of PCR have high computational costs and so are inappropriate when learning with large scale data. In this paper, we propose efficient algorithms for computing approximate PCR solutions that are, on one hand, high quality approximations to the true PCR solutions (when viewed as minimizer of a constrained optimization problem), and on the other hand entertain rigorous risk bounds (when viewed as statistical estimators). In particular, we propose an input sparsity time algorithms for approximate PCR. We also consider computing an approximate PCR in the streaming model, and kernel PCR. Empirical results demonstrate the excellent performance of our proposed methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/07/2018

A note on the prediction error of principal component regression

We analyse the prediction error of principal component regression (PCR) ...
research
12/09/2022

A note on the prediction error of principal component regression in high dimensions

We analyze the prediction error of principal component regression (PCR) ...
research
10/15/2019

Principal Component Projection and Regression in Nearly Linear Time through Asymmetric SVRG

Given a data matrix A∈R^n × d, principal component projection (PCP) and ...
research
03/22/2021

Supervised Principal Component Regression for Functional Response with High Dimensional Predictors

We propose a supervised principal component regression method for relati...
research
01/03/2023

Diffusion approximations of Oja's online principal component analysis

Oja's algorithm of principal component analysis (PCA) has been one of th...
research
10/06/2021

Boosting RANSAC via Dual Principal Component Pursuit

In this paper, we revisit the problem of local optimization in RANSAC. O...
research
12/26/2018

Large Multistream Data Analytics for Monitoring and Diagnostics in Manufacturing Systems

The high-dimensionality and volume of large scale multistream data has i...

Please sign up or login with your details

Forgot password? Click here to reset