A Survey of Singular Value Decomposition Methods for Distributed Tall/Skinny Data

09/02/2020
by   Drew Schmidt, et al.
0

The Singular Value Decomposition (SVD) is one of the most important matrix factorizations, enjoying a wide variety of applications across numerous application domains. In statistics and data analysis, the common applications of SVD such as Principal Components Analysis (PCA) and linear regression. Usually these applications arise on data that has far more rows than columns, so-called "tall/skinny" matrices. In the big data analytics context, this may take the form of hundreds of millions to billions of rows with only a few hundred columns. There is a need, therefore, for fast, accurate, and scalable tall/skinny SVD implementations which can fully utilize modern computing resources. To that end, we present a survey of three different algorithms for computing the SVD for these kinds of tall/skinny data layouts using MPI for communication. We contextualize these with common big data analytics techniques, principally PCA. Finally, we present both CPU and GPU timing results from the Summit supercomputer, and discuss possible alternative approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/28/2019

FameSVD: Fast and Memory-efficient Singular Value Decomposition

We propose a novel algorithm to perform the Singular Value Decomposition...
research
10/21/2022

Robust Singular Values based on L1-norm PCA

Singular-Value Decomposition (SVD) is a ubiquitous data analysis method ...
research
04/13/2018

Regularized Singular Value Decomposition and Application to Recommender System

Singular value decomposition (SVD) is the mathematical basis of principa...
research
12/08/2020

Sparse Correspondence Analysis for Contingency Tables

Since the introduction of the lasso in regression, various sparse method...
research
12/29/2016

Selecting Bases in Spectral learning of Predictive State Representations via Model Entropy

Predictive State Representations (PSRs) are powerful techniques for mode...
research
07/27/2023

PCA, SVD, and Centering of Data

The research detailed in this paper scrutinizes Principal Component Anal...
research
08/15/2023

Ternary Singular Value Decomposition as a Better Parameterized Form in Linear Mapping

We present a simple yet novel parameterized form of linear mapping to ac...

Please sign up or login with your details

Forgot password? Click here to reset