Polynomial Time and Sample Complexity for Non-Gaussian Component Analysis: Spectral Methods

04/04/2017
by   Yan Shuo Tan, et al.
0

The problem of Non-Gaussian Component Analysis (NGCA) is about finding a maximal low-dimensional subspace E in R^n so that data points projected onto E follow a non-gaussian distribution. Although this is an appropriate model for some real world data analysis problems, there has been little progress on this problem over the last decade. In this paper, we attempt to address this state of affairs in two ways. First, we give a new characterization of standard gaussian distributions in high-dimensions, which lead to effective tests for non-gaussianness. Second, we propose a simple algorithm, Reweighted PCA, as a method for solving the NGCA problem. We prove that for a general unknown non-gaussian distribution, this algorithm recovers at least one direction in E, with sample and time complexity depending polynomially on the dimension of the ambient space. We conjecture that the algorithm actually recovers the entire E.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2011

Sparse Non Gaussian Component Analysis by Semidefinite Programming

Sparse non-Gaussian component analysis (SNGCA) is an unsupervised method...
research
12/16/2021

Non-Gaussian Component Analysis via Lattice Basis Reduction

Non-Gaussian Component Analysis (NGCA) is the following distribution lea...
research
01/28/2016

Non-Gaussian Component Analysis with Log-Density Gradient Estimation

Non-Gaussian component analysis (NGCA) is aimed at identifying a linear ...
research
07/13/2018

Non-Gaussian Component Analysis using Entropy Methods

Non-Gaussian component analysis (NGCA) is a problem in multidimensional ...
research
03/15/2013

Subspace Clustering via Thresholding and Spectral Clustering

We consider the problem of clustering a set of high-dimensional data poi...
research
06/12/2020

Robust Sub-Gaussian Principal Component Analysis and Width-Independent Schatten Packing

We develop two methods for the following fundamental statistical task: g...
research
04/28/2020

Learning Polynomials of Few Relevant Dimensions

Polynomial regression is a basic primitive in learning and statistics. I...

Please sign up or login with your details

Forgot password? Click here to reset