Randomized Least Squares Regression: Combining Model- and Algorithm-Induced Uncertainties

08/17/2018
by   Jocelyn T. Chi, et al.
0

We analyze the uncertainties in the minimum norm solution of full-rank regression problems, arising from Gaussian linear models, computed by randomized (row-wise sampling and, more generally, sketching) algorithms. From a deterministic perspective, our structural perturbation bounds imply that least squares problems are less sensitive to multiplicative perturbations than to additive perturbations. From a probabilistic perspective, our expressions for the total expectation and variance with regard to both model- and algorithm-induced uncertainties, are exact, hold for general sketching matrices, and make no assumptions on the rank of the sketched matrix. The relative differences between the total bias and variance on the one hand, and the model bias and variance on the other hand, are governed by two factors: (i) the expected rank deficiency of the sketched matrix, and (ii) the expected difference between projectors associated with the original and the sketched problems. A simple example, based on uniform sampling with replacement, illustrates the statistical quantities.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/20/2019

Uniform bounds for invariant subspace perturbations

For a fixed matrix A and perturbation E we develop purely deterministic ...
research
09/27/2019

Total Least Squares Regression in Input Sparsity Time

In the total least squares problem, one is given an m × n matrix A, and ...
research
12/31/2019

A doubly stochastic block Gauss-Seidel algorithm for solving linear equations

We propose a simple doubly stochastic block Gauss-Seidel algorithm for s...
research
06/22/2021

Efficient recursive least squares solver for rank-deficient matrices

Updating a linear least squares solution can be critical for near real-t...
research
06/23/2013

A Statistical Perspective on Algorithmic Leveraging

One popular method for dealing with large-scale data sets is sampling. F...
research
07/12/2020

Multiplicative Perturbation Bounds for Multivariate Multiple Linear Regression in Schatten p-Norms

Multivariate multiple linear regression (MMLR), which occurs in a number...
research
03/05/2019

Reduced-rank Analysis of the Total Least Squares

The reduced-rank method exploits the distortion-variance tradeoff to yie...

Please sign up or login with your details

Forgot password? Click here to reset