On randomized sketching algorithms and the Tracy-Widom law

01/03/2022
by   Daniel Ahfock, et al.
0

There is an increasing body of work exploring the integration of random projection into algorithms for numerical linear algebra. The primary motivation is to reduce the overall computational cost of processing large datasets. A suitably chosen random projection can be used to embed the original dataset in a lower-dimensional space such that key properties of the original dataset are retained. These algorithms are often referred to as sketching algorithms, as the projected dataset can be used as a compressed representation of the full dataset. We show that random matrix theory, in particular the Tracy-Widom law, is useful for describing the operating characteristics of sketching algorithms in the tall-data regime when n ≫ d. Asymptotic large sample results are of particular interest as this is the regime where sketching is most useful for data compression. In particular, we develop asymptotic approximations for the success rate in generating random subspace embeddings and the convergence probability of iterative sketching algorithms. We test a number of sketching algorithms on real large high-dimensional datasets and find that the asymptotic expressions give accurate predictions of the empirical performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/17/2020

Asymptotic Analysis of an Ensemble of Randomly Projected Linear Discriminants

Datasets from the fields of bioinformatics, chemometrics, and face recog...
research
09/22/2021

Randomized Projection Learning Method forDynamic Mode Decomposition

A data-driven analysis method known as dynamic mode decomposition (DMD) ...
research
10/14/2018

A New Theory for Sketching in Linear Regression

Large datasets create opportunities as well as analytic challenges. A re...
research
09/08/2021

Functional Principal Subspace Sampling for Large Scale Functional Data Analysis

Functional data analysis (FDA) methods have computational and theoretica...
research
11/06/2019

Faster Least Squares Optimization

We investigate randomized methods for solving overdetermined linear leas...
research
03/14/2018

Optimal Bounds for Johnson-Lindenstrauss Transformations

In 1984, Johnson and Lindenstrauss proved that any finite set of data in...
research
04/29/2014

Randomized Sketches of Convex Programs with Sharp Guarantees

Random projection (RP) is a classical technique for reducing storage and...

Please sign up or login with your details

Forgot password? Click here to reset