Generalized Low Rank Models

10/01/2014
by   Madeleine Udell, et al.
0

Principal components analysis (PCA) is a well-known technique for approximating a tabular data set by a low rank matrix. Here, we extend the idea of PCA to handle arbitrary data sets consisting of numerical, Boolean, categorical, ordinal, and other data types. This framework encompasses many well known techniques in data analysis, such as nonnegative matrix factorization, matrix completion, sparse and robust PCA, k-means, k-SVD, and maximum margin matrix factorization. The method handles heterogeneous data sets, and leads to coherent schemes for compressing, denoising, and imputing missing entries across all data types simultaneously. It also admits a number of interesting interpretations of the low rank factors, which allow clustering of examples or of features. We propose several parallel algorithms for fitting generalized low rank models, and describe implementations and numerical results.

READ FULL TEXT

page 33

page 37

page 38

research
11/14/2018

Matrix rigidity and the ill-posedness of Robust PCA and matrix completion

Robust Principal Component Analysis (PCA) (Candes et al., 2011) and low-...
research
08/15/2023

Nonnegative matrix factorization for coherent set identification by direct low rank maximum likelihood estimation

We analyze connections between two low rank modeling approaches from the...
research
10/01/2020

Deep matrix factorizations

Constrained low-rank matrix approximations have been known for decades a...
research
09/01/2020

Rank-one partitioning: formalization, illustrative examples, and a new cluster enhancing strategy

In this paper, we introduce and formalize a rank-one partitioning learni...
research
04/22/2019

Low-Rank Approximation from Communication Complexity

In low-rank approximation with missing entries, given A∈R^n× n and binar...
research
07/19/2014

Tight convex relaxations for sparse matrix factorization

Based on a new atomic norm, we propose a new convex formulation for spar...
research
11/11/2018

Fast Matrix Factorization with Non-Uniform Weights on Missing Data

Matrix factorization (MF) has been widely used to discover the low-rank ...

Please sign up or login with your details

Forgot password? Click here to reset