Inv-ASKIT: A Parallel Fast Diret Solver for Kernel Matrices

02/03/2016
by   Chenhan D. Yu, et al.
0

We present a parallel algorithm for computing the approximate factorization of an N-by-N kernel matrix. Once this factorization has been constructed (with N ^2 N work), we can solve linear systems with this matrix with N N work. Kernel matrices represent pairwise interactions of points in metric spaces. They appear in machine learning, approximation theory, and computational physics. Kernel matrices are typically dense (matrix multiplication scales quadratically with N) and ill-conditioned (solves can require 100s of Krylov iterations). Thus, fast algorithms for matrix multiplication and factorization are critical for scalability. Recently we introduced ASKIT, a new method for approximating a kernel matrix that resembles N-body methods. Here we introduce INV-ASKIT, a factorization scheme based on ASKIT. We describe the new method, derive complexity estimates, and conduct an empirical study of its accuracy and scalability. We report results on real-world datasets including "COVTYPE" (0.5M points in 54 dimensions), "SUSY" (4.5M points in 8 dimensions) and "MNIST" (2M points in 784 dimensions) using shared and distributed memory parallelism. In our largest run we approximately factorize a dense matrix of size 32M × 32M (generated from points in 64 dimensions) on 4,096 Sandy-Bridge cores. To our knowledge these results improve the state of the art by several orders of magnitude.

READ FULL TEXT
research
11/18/2018

Stark: Fast and Scalable Strassen's Matrix Multiplication using Apache Spark

This paper presents a new fast, highly scalable distributed matrix multi...
research
03/24/2021

Accelerating Sparse Approximate Matrix Multiplication on GPUs

Although the matrix multiplication plays a vital role in computational l...
research
08/01/2019

GLU3.0: Fast GPU-based Parallel Sparse LU Factorization for Circuit Simulation

In this article, we propose a new GPU-based sparse LU factorization meth...
research
03/06/2019

PBBFMM3D: a parallel black-box algorithm for kernel matrix-vector multiplication

We introduce PBBFMM3D, a parallel black-box method for computing kernel ...
research
02/02/2022

Giga-scale Kernel Matrix Vector Multiplication on GPU

Kernel matrix-vector multiplication (KMVM) is a foundational operation i...
research
03/06/2019

PBBFMM3D: a Parallel Black-Box Fast Multipole Method for Non-oscillatory Kernels

This paper presents PBBFMM3D: a parallel black-box fast multipole method...
research
11/22/2017

Fast and Stable Pascal Matrix Algorithms

In this paper, we derive a family of fast and stable algorithms for mult...

Please sign up or login with your details

Forgot password? Click here to reset