Identifiability of Low-Rank Sparse Component Analysis

08/27/2018
by   Jérémy E. Cohen, et al.
0

Sparse component analysis (SCA) is the following problem: Given an input matrix M and an integer r, find a dictionary D with r columns and a sparse matrix B with r rows such that M ≈ DB. A key issue in SCA is identifiability, that is, characterizing the conditions under which D and B are essentially unique (that is, they are unique up to permutation and scaling of the columns of D and rows of B). Although SCA has been vastly investigated in the last two decades, only a few works have tackled this issue in the deterministic scenario, and no work provides reasonable bounds in the minimum number of data points (that is, columns of M) that leads to identifiability. In this work, we provide new results in the deterministic scenario when the data has a low-rank structure, that is, when D has rank r, drastically improving with respect to previous results. In particular, we show that if each column of B contains at least s zeros then O(r^3/s^2) data points are sufficient to obtain an essentially unique decomposition, as long as these data points are well spread among the subspaces spanned by r-1 columns of D. This implies for example that for a fixed proportion of zeros (constant and independent of r, e.g., 10% of zero entries in B), one only requires O(r) data points to guarantee identifiability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/07/2017

Low Rank Matrix Recovery with Simultaneous Presence of Outliers and Sparse Corruption

We study a data model in which the data matrix D can be expressed as D =...
research
09/26/2022

Bounded Simplex-Structured Matrix Factorization

In this paper, we propose a new low-rank matrix factorization model dubb...
research
02/01/2015

High Dimensional Low Rank plus Sparse Matrix Decomposition

This paper is concerned with the problem of low rank plus sparse matrix ...
research
04/26/2018

Tensor Methods for Nonlinear Matrix Completion

In the low rank matrix completion (LRMC) problem, the low rank assumptio...
research
11/24/2021

Dictionary-based Low-Rank Approximations and the Mixed Sparse Coding problem

Constrained tensor and matrix factorization models allow to extract inte...
research
02/21/2019

A Dictionary Based Generalization of Robust PCA

We analyze the decomposition of a data matrix, assumed to be a superposi...
research
03/25/2021

Biwhitening Reveals the Rank of a Count Matrix

Estimating the rank of a corrupted data matrix is an important task in d...

Please sign up or login with your details

Forgot password? Click here to reset