Sharpened Error Bounds for Random Sampling Based ℓ_2 Regression

03/30/2014
by   Shusen Wang, et al.
0

Given a data matrix X ∈ R^n× d and a response vector y ∈ R^n, suppose n>d, it costs O(n d^2) time and O(n d) space to solve the least squares regression (LSR) problem. When n and d are both large, exactly solving the LSR problem is very expensive. When n ≫ d, one feasible approach to speeding up LSR is to randomly embed y and all columns of X into a smaller subspace R^c; the induced LSR problem has the same number of columns but much fewer number of rows, and it can be solved in O(c d^2) time and O(c d) space. We discuss in this paper two random sampling based methods for solving LSR more efficiently. Previous work showed that the leverage scores based sampling based LSR achieves 1+ϵ accuracy when c ≥ O(d ϵ^-2 d). In this paper we sharpen this error bound, showing that c = O(d d + d ϵ^-1) is enough for achieving 1+ϵ accuracy. We also show that when c ≥ O(μ d ϵ^-2 d), the uniform sampling based LSR attains a 2+ϵ bound with positive probability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/18/2022

Optimal Algorithms for Linear Algebra in the Current Matrix Multiplication Time

We study fundamental problems in linear algebra, such as finding a maxim...
research
12/01/2020

Time-Space Lower Bounds for Simulating Proof Systems with Quantum and Randomized Verifiers

A line of work initiated by Fortnow in 1997 has proven model-independent...
research
05/19/2021

L1 Regression with Lewis Weights Subsampling

We consider the problem of finding an approximate solution to ℓ_1 regres...
research
01/08/2020

Stability of Sampling for CUR Decompositions

This article studies how to form CUR decompositions of low-rank matrices...
research
05/04/2015

An Explicit Sampling Dependent Spectral Error Bound for Column Subset Selection

In this paper, we consider the problem of column subset selection. We pr...
research
06/01/2023

Sharper Bounds for ℓ_p Sensitivity Sampling

In large scale machine learning, random sampling is a popular way to app...
research
11/23/2022

Filtering for Anderson acceleration

This work introduces, analyzes and demonstrates an efficient and theoret...

Please sign up or login with your details

Forgot password? Click here to reset