Samplets: A new paradigm for data compression

07/07/2021
by   Helmut Harbrecht, et al.
0

In this article, we introduce the concept of samplets by transferring the construction of Tausch-White wavelets to the realm of data. This way we obtain a multilevel representation of discrete data which directly enables data compression, detection of singularities and adaptivity. Applying samplets to represent kernel matrices, as they arise in kernel based learning or Gaussian process regression, we end up with quasi-sparse matrices. By thresholding small entries, these matrices are compressible to O(N log N) relevant entries, where N is the number of data points. This feature allows for the use of fill-in reducing reorderings to obtain a sparse factorization of the compressed matrices. Besides the comprehensive introduction to samplets and their properties, we present extensive numerical studies to benchmark the approach. Our results demonstrate that samplets mark a considerable step in the direction of making large data sets accessible for analysis.

READ FULL TEXT

page 10

page 21

research
02/14/2017

Mutual Kernel Matrix Completion

With the huge influx of various data nowadays, extracting knowledge from...
research
11/21/2022

Multiresolution kernel matrix algebra

We propose a sparse arithmetic for kernel matrices, enabling efficient s...
research
06/04/2022

Data-driven Construction of Hierarchical Matrices with Nested Bases

Hierarchical matrices provide a powerful representation for significantl...
research
04/05/2016

Fast methods for training Gaussian processes on large data sets

Gaussian process regression (GPR) is a non-parametric Bayesian technique...
research
01/27/2023

Data sparse multilevel covariance estimation in optimal complexity

We consider the ℋ^2-formatted compression and computational estimation o...
research
02/01/2012

Kernels on Sample Sets via Nonparametric Divergence Estimates

Most machine learning algorithms, such as classification or regression, ...
research
07/21/2023

Sparse Cholesky factorization by greedy conditional selection

Dense kernel matrices resulting from pairwise evaluations of a kernel fu...

Please sign up or login with your details

Forgot password? Click here to reset