How many variables should be entered in a principal component regression equation?

06/04/2019
by   Ji Xu, et al.
6

We study least squares linear regression over N uncorrelated Gaussian features that are selected in order of decreasing variance. When the number of selected features p is at most the sample size n, the estimator under consideration coincides with the principal component regression estimator; when p>n, the estimator is the least ℓ_2 norm solution over the selected features. We give an average-case analysis of the out-of-sample prediction error as p,n,N →∞ with p/N →α and n/N →β, for some constants α∈ [0,1] and β∈ (0,1). In this average-case setting, the prediction error exhibits a `double descent' shape as a function of p.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/04/2023

A note on the variance in principal component regression

Principal component regression is a popular method to use when the predi...
research
10/27/2020

On Principal Component Regression in a High-Dimensional Error-in-Variables Setting

We analyze the classical method of Principal Component Regression (PCR) ...
research
07/03/2023

Adaptive Principal Component Regression with Applications to Panel Data

Principal component regression (PCR) is a popular technique for fixed-de...
research
11/16/2017

An Efficient Bayesian Robust Principal Component Regression

Principal component regression is a linear regression model with princip...
research
09/03/2022

Forbidden Knowledge and Specialized Training: A Versatile Solution for the Two Main Sources of Overfitting in Linear Regression

Overfitting in linear regression is broken down into two main causes. Fi...
research
11/27/2019

A race-DC in Big Data

The strategy of divide-and-combine (DC) has been widely used in the area...
research
01/28/2019

Secure multi-party linear regression at plaintext speed

We detail a scheme for scalable, distributed, secure multiparty linear r...

Please sign up or login with your details

Forgot password? Click here to reset