Prediction in latent factor regression: Adaptive PCR and beyond

07/20/2020
by   Xin Bing, et al.
0

This work is devoted to the finite sample prediction risk analysis of a class of linear predictors of a response Y∈ℝ from a high-dimensional random vector X∈ℝ^p when (X,Y) follows a latent factor regression model generated by a unobservable latent vector Z of dimension less than p. Our primary contribution is in establishing finite sample risk bounds for prediction with the ubiquitous Principal Component Regression (PCR) method, under the factor regression model, with the number of principal components adaptively selected from the data—a form of theoretical guarantee that is surprisingly lacking from the PCR literature. To accomplish this, we prove a master theorem that establishes a risk bound for a large class of predictors, including the PCR predictor as a special case. This approach has the benefit of providing a unified framework for the analysis of a wide range of linear prediction methods, under the factor regression setting. In particular, we use our main theorem to recover known risk bounds for the minimum-norm interpolating predictor, which has received renewed attention in the past two years, and a prediction method tailored to a subclass of factor regression models with identifiable parameters. This model-tailored method can be interpreted as prediction via clusters with latent centers. To address the problem of selecting among a set of candidate predictors, we analyze a simple model selection procedure based on data-splitting, providing an oracle inequality under the factor model to prove that the performance of the selected predictor is close to the optimal candidate. We conclude with a detailed simulation study to support and complement our theoretical results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/06/2020

Interpolation under latent factor regression models

This work studies finite-sample properties of the risk of the minimum-no...
research
11/09/2021

Function-on-function linear quantile regression

In this study, we propose a function-on-function linear quantile regress...
research
03/18/2019

Score predictor factor analysis: Reproducing observed covariances by means of factor score predictors

The model implied by factor score predictors does not reproduce the non-...
research
06/10/2019

Selection consistency of Lasso-based procedures for misspecified high-dimensional binary model and random regressors

We consider selection of random predictors for high-dimensional regressi...
research
05/28/2022

Low-rank Latent Matrix Factor-Analysis Modeling for Generalized Linear Regression with High-dimensional Imaging Biomarkers

Medical imaging has been recognized as a phenotype associated with vario...
research
11/16/2017

An Efficient Bayesian Robust Principal Component Regression

Principal component regression is a linear regression model with princip...
research
10/31/2017

Consistency of Generalized Dynamic Principal Components in Dynamic Factor Models

We study the theoretical properties of the generalized dynamic principal...

Please sign up or login with your details

Forgot password? Click here to reset