An Explicit Sampling Dependent Spectral Error Bound for Column Subset Selection

05/04/2015
by   Tianbao Yang, et al.
0

In this paper, we consider the problem of column subset selection. We present a novel analysis of the spectral norm reconstruction for a simple randomized algorithm and establish a new bound that depends explicitly on the sampling probabilities. The sampling dependent error bound (i) allows us to better understand the tradeoff in the reconstruction error due to sampling probabilities, (ii) exhibits more insights than existing error bounds that exploit specific probability distributions, and (iii) implies better sampling distributions. In particular, we show that a sampling distribution with probabilities proportional to the square root of the statistical leverage scores is always better than uniform sampling and is better than leverage-based sampling when the statistical leverage scores are very nonuniform. And by solving a constrained optimization problem related to the error bound with an efficient bisection search we are able to achieve better performance than using either the leverage-based distribution or that proportional to the square root of the statistical leverage scores. Numerical simulations demonstrate the benefits of the new sampling distributions for low-rank matrix approximation and least square approximation compared to state-of-the art algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 9

research
06/12/2013

Completing Any Low-rank Matrix, Provably

Matrix completion, i.e., the exact and provable recovery of a low-rank m...
research
10/14/2014

Tighter Low-rank Approximation via Sampling the Leveraged Element

In this work, we propose a new randomized algorithm for computing a low-...
research
04/06/2014

Provable Deterministic Leverage Score Sampling

We explain theoretically a curious empirical phenomenon: "Approximating ...
research
03/22/2015

Relaxed Leverage Sampling for Low-rank Matrix Completion

We consider the problem of exact recovery of any m× n matrix of rank ϱ f...
research
05/21/2018

Relating Leverage Scores and Density using Regularized Christoffel Functions

Statistical leverage scores emerged as a fundamental tool for matrix ske...
research
03/30/2014

Sharpened Error Bounds for Random Sampling Based ℓ_2 Regression

Given a data matrix X ∈ R^n× d and a response vector y ∈ R^n, suppose n>...
research
03/15/2018

Ridge Regression and Provable Deterministic Ridge Leverage Score Sampling

Ridge leverage scores provide a balance between low-rank approximation a...

Please sign up or login with your details

Forgot password? Click here to reset