Generalization Bounds for Data-Driven Numerical Linear Algebra

06/16/2022
by   Peter Bartlett, et al.
0

Data-driven algorithms can adapt their internal structure or parameters to inputs from unknown application-specific distributions, by learning from a training sample of inputs. Several recent works have applied this approach to problems in numerical linear algebra, obtaining significant empirical gains in performance. However, no theoretical explanation for their success was known. In this work we prove generalization bounds for those algorithms, within the PAC-learning framework for data-driven algorithm selection proposed by Gupta and Roughgarden (SICOMP 2017). Our main results are closely matching upper and lower bounds on the fat shattering dimension of the learning-based low rank approximation algorithm of Indyk et al. (NeurIPS 2019). Our techniques are general, and provide generalization bounds for many other recently proposed data-driven algorithms in numerical linear algebra, covering both sketching-based and multigrid-based methods. This considerably broadens the class of data-driven algorithms for which a PAC-learning analysis is available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/22/2023

The Sample Complexity of Multi-Distribution Learning for VC Classes

Multi-distribution learning is a natural generalization of PAC learning ...
research
03/10/2022

Data-driven Abstractions with Probabilistic Guarantees for Linear PETC Systems

We employ the scenario approach to compute probably approximately correc...
research
09/10/2009

Chromatic PAC-Bayes Bounds for Non-IID Data: Applications to Ranking and Stationary β-Mixing Processes

Pac-Bayes bounds are among the most accurate generalization bounds for c...
research
04/28/2018

Data-Driven Methods for Solving Algebra Word Problems

We explore contemporary, data-driven techniques for solving math word pr...
research
12/18/2021

Data-Driven Reachability analysis and Support set Estimation with Christoffel Functions

We present algorithms for estimating the forward reachable set of a dyna...
research
12/22/2011

Similarity-based Learning via Data Driven Embeddings

We consider the problem of classification using similarity/distance func...
research
04/28/2021

Symbolic Abstractions From Data: A PAC Learning Approach

Symbolic control techniques aim to satisfy complex logic specifications....

Please sign up or login with your details

Forgot password? Click here to reset