Stochastic Subsampling for Factorizing Huge Matrices

01/19/2017
by   Arthur Mensch, et al.
0

We present a matrix-factorization algorithm that scales to input matrices with both huge number of rows and columns. Learned factors may be sparse or dense and/or non-negative, which makes our algorithm suitable for dictionary learning, sparse component analysis, and non-negative matrix factorization. Our algorithm streams matrix columns while subsampling them to iteratively learn the matrix factors. At each iteration, the row dimension of a new sample is reduced by subsampling, resulting in lower time complexity compared to a simple streaming algorithm. Our method comes with convergence guarantees to reach a stationary point of the matrix-factorization problem. We demonstrate its efficiency on massive functional Magnetic Resonance Imaging data (2 TB), and on patches extracted from hyperspectral images (103 GB). For both problems, which involve different penalties on rows and columns, we obtain significant speed-ups compared to state-of-the-art algorithms.

READ FULL TEXT
research
05/03/2016

Dictionary Learning for Massive Matrix Factorization

Sparse matrix factorization is a popular tool to obtain interpretable da...
research
02/04/2011

A convex model for non-negative matrix factorization and dimensionality reduction on physical space

A collaborative convex framework for factoring a data matrix X into a no...
research
05/19/2017

A Unified Framework for Stochastic Matrix Factorization via Variance Reduction

We propose a unified framework to speed up the existing stochastic matri...
research
08/06/2018

A Survey on Surrogate Approaches to Non-negative Matrix Factorization

Motivated by applications in hyperspectral imaging we investigate method...
research
04/10/2020

The Permuted Striped Block Model and its Factorization – Algorithms with Recovery Guarantees

We introduce a novel class of matrices which are defined by the factoriz...
research
04/15/2018

A Sparse Non-negative Matrix Factorization Framework for Identifying Functional Units of Tongue Behavior from MRI

Muscle coordination patterns of lingual behaviors are synergies generate...

Please sign up or login with your details

Forgot password? Click here to reset