On the Estimation of Coherence

09/04/2010
by   Mehryar Mohri, et al.
0

Low-rank matrix approximations are often used to help scale standard machine learning algorithms to large-scale problems. Recently, matrix coherence has been used to characterize the ability to extract global information from a subset of matrix entries in the context of these low-rank approximations and other sampling-based algorithms, e.g., matrix com- pletion, robust PCA. Since coherence is defined in terms of the singular vectors of a matrix and is expensive to compute, the practical significance of these results largely hinges on the following question: Can we efficiently and accurately estimate the coherence of a matrix? In this paper we address this question. We propose a novel algorithm for estimating coherence from a small number of columns, formally analyze its behavior, and derive a new coherence-based matrix approximation bound based on this analysis. We then present extensive experimental results on synthetic and real datasets that corroborate our worst-case theoretical analysis, yet provide strong support for the use of our proposed algorithm whenever low-rank approximation is being considered. Our algorithm efficiently and accurately estimates matrix coherence across a wide range of datasets, and these coherence estimates are excellent predictors of the effectiveness of sampling-based matrix approximation on a case-by-case basis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/09/2014

Matrix Coherence and the Nystrom Method

The Nystrom method is an efficient technique used to speed up large-scal...
research
10/16/2017

Nonsmooth Frank-Wolfe using Uniform Affine Approximations

Frank-Wolfe methods (FW) have gained significant interest in the machine...
research
11/11/2021

On Recovering the Best Rank-r Approximation from Few Entries

In this note, we investigate how well we can reconstruct the best rank-r...
research
11/17/2021

A quantum-inspired algorithm for approximating statistical leverage scores

Suppose a matrix A ∈ℝ^m × n of rank k with singular value decomposition ...
research
03/22/2014

CUR Algorithm with Incomplete Matrix Observation

CUR matrix decomposition is a randomized algorithm that can efficiently ...
research
09/22/2022

Randomized low-rank approximation of monotone matrix functions

This work is concerned with computing low-rank approximations of a matri...
research
02/21/2020

Improved guarantees and a multiple-descent curve for the Column Subset Selection Problem and the Nyström method

The Column Subset Selection Problem (CSSP) and the Nyström method are am...

Please sign up or login with your details

Forgot password? Click here to reset