`Basic' Generalization Error Bounds for Least Squares Regression with Well-specified Models

09/20/2021
by   Karthik Duraisamy, et al.
0

This note examines the behavior of generalization capabilities - as defined by out-of-sample mean squared error (MSE) - of Linear Gaussian (with a fixed design matrix) and Linear Least Squares regression. Particularly, we consider a well-specified model setting, i.e. we assume that there exists a `true' combination of model parameters within the chosen model form. While the statistical properties of Least Squares regression have been extensively studied over the past few decades - particularly with less restrictive problem statements compared to the present work - this note targets bounds that are non-asymptotic and more quantitative compared to the literature. Further, the analytical formulae for distributions and bounds (on the MSE) are directly compared to numerical experiments. Derivations are presented in a self-contained and pedagogical manner, in a way that a reader with a basic knowledge of probability and statistics can follow.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/07/2023

A note on the optimum allocation of resources to follow up unit nonrespondents in probability

Common practice to address nonresponse in probability surveys in Nationa...
research
04/10/2018

Subsampled Optimization: Statistical Guarantees, Mean Squared Error Approximation, and Sampling Method

For optimization on large-scale data, exactly calculating its solution m...
research
12/11/2018

Distribution-free properties of isotonic regression

It is well known that the isotonic least squares estimator is characteri...
research
05/10/2021

A rigorous introduction for linear models

This note is meant to provide an introduction to linear models and the t...
research
12/30/2019

All-or-Nothing Phenomena: From Single-Letter to High Dimensions

We consider the linear regression problem of estimating a p-dimensional ...
research
05/13/2023

A note on bounded distance-based information loss metrics for statistical disclosure control of numeric microdata

In the field of statistical disclosure control, the tradeoff between dat...
research
11/27/2022

Generalizing Gaussian Smoothing for Random Search

Gaussian smoothing (GS) is a derivative-free optimization (DFO) algorith...

Please sign up or login with your details

Forgot password? Click here to reset