A Fast Factorization-based Approach to Robust PCA

09/27/2016
by   Chong Peng, et al.
0

Robust principal component analysis (RPCA) has been widely used for recovering low-rank matrices in many data mining and machine learning problems. It separates a data matrix into a low-rank part and a sparse part. The convex approach has been well studied in the literature. However, state-of-the-art algorithms for the convex approach usually have relatively high complexity due to the need of solving (partial) singular value decompositions of large matrices. A non-convex approach, AltProj, has also been proposed with lighter complexity and better scalability. Given the true rank r of the underlying low rank matrix, AltProj has a complexity of O(r^2dn), where d× n is the size of data matrix. In this paper, we propose a novel factorization-based model of RPCA, which has a complexity of O(kdn), where k is an upper bound of the true rank. Our method does not need the precise value of the true rank. From extensive experiments, we observe that AltProj can work only when r is precisely known in advance; however, when the needed rank parameter r is specified to a value different from the true rank, AltProj cannot fully separate the two parts while our method succeeds. Even when both work, our method is about 4 times faster than AltProj. Our method can be used as a light-weight, scalable tool for RPCA in the absence of the precise value of the true rank.

READ FULL TEXT
research
04/16/2019

RES-PCA: A Scalable Approach to Recovering Low-rank Matrices

Robust principal component analysis (RPCA) has drawn significant attenti...
research
12/08/2017

Fast Low-Rank Matrix Estimation without the Condition Number

In this paper, we study the general problem of optimizing a convex funct...
research
12/12/2019

Fast non-convex low-rank matrix decomposition for separation of potential field data using minimal memory

A fast non-convex low-rank matrix decomposition method for potential fie...
research
02/25/2019

Logistic principal component analysis via non-convex singular value thresholding

Multivariate binary data is becoming abundant in current biological rese...
research
02/15/2018

Fast Generalized Conditional Gradient Method with Applications to Matrix Recovery Problems

Motivated by matrix recovery problems such as Robust Principal Component...
research
11/19/2018

Classical Algorithms from Quantum and Arthur-Merlin Communication Protocols

The polynomial method from circuit complexity has been applied to severa...
research
08/26/2011

Solving Principal Component Pursuit in Linear Time via l_1 Filtering

In the past decades, exactly recovering the intrinsic data structure fro...

Please sign up or login with your details

Forgot password? Click here to reset