High Dimensional Low Rank plus Sparse Matrix Decomposition

02/01/2015
by   Mostafa Rahmani, et al.
0

This paper is concerned with the problem of low rank plus sparse matrix decomposition for big data. Conventional algorithms for matrix decomposition use the entire data to extract the low-rank and sparse components, and are based on optimization problems with complexity that scales with the dimension of the data, which limits their scalability. Furthermore, existing randomized approaches mostly rely on uniform random sampling, which is quite inefficient for many real world data matrices that exhibit additional structures (e.g. clustering). In this paper, a scalable subspace-pursuit approach that transforms the decomposition problem to a subspace learning problem is proposed. The decomposition is carried out using a small data sketch formed from sampled columns/rows. Even when the data is sampled uniformly at random, it is shown that the sufficient number of sampled columns/rows is roughly O(rμ), where μ is the coherency parameter and r the rank of the low rank component. In addition, adaptive sampling algorithms are proposed to address the problem of column/row sampling from structured data. We provide an analysis of the proposed method with adaptive sampling and show that adaptive sampling makes the required number of sampled columns/rows invariant to the distribution of the data. The proposed approach is amenable to online implementation and an online scheme is proposed.

READ FULL TEXT

page 9

page 10

research
01/08/2020

Stability of Sampling for CUR Decompositions

This article studies how to form CUR decompositions of low-rank matrices...
research
09/02/2013

Unmixing Incoherent Structures of Big Data by Randomized or Greedy Decomposition

Learning big data by matrix decomposition always suffers from expensive ...
research
11/18/2016

Robust and Scalable Column/Row Sampling from Corrupted Big Data

Conventional sampling techniques fall short of drawing descriptive sketc...
research
05/09/2017

Spatial Random Sampling: A Structure-Preserving Data Sketching Tool

Random column sampling is not guaranteed to yield data sketches that pre...
research
08/27/2018

Identifiability of Low-Rank Sparse Component Analysis

Sparse component analysis (SCA) is the following problem: Given an input...
research
06/27/2019

High-Dimensional Optimization in Adaptive Random Subspaces

We propose a new randomized optimization method for high-dimensional pro...
research
03/09/2023

Fitting Low-rank Models on Egocentrically Sampled Partial Networks

The statistical modeling of random networks has been widely used to unco...

Please sign up or login with your details

Forgot password? Click here to reset