An Efficient Algorithm for Non-Negative Matrix Factorization with Random Projections

12/06/2017
by   Gabriele Torre, et al.
0

Non-negative matrix factorization (NMF) is one of the most popular decomposition techniques for multivariate data. NMF is a core method for many machine-learning related computational problems, such as data compression, feature extraction, word embedding, recommender systems etc. In practice, however, its application is challenging for large datasets. The efficiency of NMF is constrained by long data loading times, by large memory requirements and by limited parallelization capabilities. Here we present a novel and efficient compressed NMF algorithm. Our algorithm applies a random compression scheme to drastically reduce the dimensionality of the problem, preserving well the pairwise distances between data points and inherently limiting the memory and communication load. Our algorithm supersedes existing methods in speed. Nonetheless, it matches the best non-compressed algorithms in reconstruction precision.

READ FULL TEXT

page 9

page 11

page 15

research
09/08/2021

Initialization for Nonnegative Matrix Factorization: a Comprehensive Review

Non-negative matrix factorization (NMF) has become a popular method for ...
research
02/05/2018

Robust Vertex Enumeration for Convex Hulls in High Dimensions

Computation of the vertices of the convex hull of a set S of n points in...
research
03/05/2018

Relative Pairwise Relationship Constrained Non-negative Matrix Factorisation

Non-negative Matrix Factorisation (NMF) has been extensively used in mac...
research
04/16/2019

PL-NMF: Parallel Locality-Optimized Non-negative Matrix Factorization

Non-negative Matrix Factorization (NMF) is a key kernel for unsupervised...
research
11/10/2020

Gaussian Compression Stream: Principle and Preliminary Results

Random projections became popular tools to process big data. In particul...
research
06/25/2017

There and Back Again: A General Approach to Learning Sparse Models

We propose a simple and efficient approach to learning sparse models. Ou...
research
01/23/2019

PD-ML-Lite: Private Distributed Machine Learning from Lighweight Cryptography

Privacy is a major issue in learning from distributed data. Recently the...

Please sign up or login with your details

Forgot password? Click here to reset