Detecting the large entries of a sparse covariance matrix in sub-quadratic time

05/12/2015
by   Ofer Shwartz, et al.
0

The covariance matrix of a p-dimensional random variable is a fundamental quantity in data analysis. Given n i.i.d. observations, it is typically estimated by the sample covariance matrix, at a computational cost of O(np^2) operations. When n,p are large, this computation may be prohibitively slow. Moreover, in several contemporary applications, the population matrix is approximately sparse, and only its few large entries are of interest. This raises the following question, at the focus of our work: Assuming approximate sparsity of the covariance matrix, can its large entries be detected much faster, say in sub-quadratic time, without explicitly computing all its p^2 entries? In this paper, we present and theoretically analyze two randomized algorithms that detect the large entries of an approximately sparse sample covariance matrix using only O(np poly log p) operations. Furthermore, assuming sparsity of the population matrix, we derive sufficient conditions on the underlying random variable and on the number of samples n, for the sample covariance matrix to satisfy our approximate sparsity requirements. Finally, we illustrate the performance of our algorithms via several simulations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/04/2018

Active covariance estimation by random sub-sampling of variables

We study covariance matrix estimation for the case of partially observed...
research
12/10/2020

Bayesian nonstationary and nonparametric covariance estimation for large spatial data

In spatial statistics, it is often assumed that the spatial field of int...
research
11/01/2022

Fluctuations of the diagonal entries of a large sample precision matrix

For a given p× n data matrix X_n with i.i.d. centered entries and a popu...
research
05/23/2016

Sub-Gaussian estimators of the mean of a random matrix with heavy-tailed entries

Estimation of the covariance matrix has attracted a lot of attention of ...
research
06/01/2016

Graph-Guided Banding of the Covariance Matrix

Regularization has become a primary tool for developing reliable estimat...
research
11/22/2017

Sparsity-based Cholesky Factorization and its Application to Hyperspectral Anomaly Detection

Estimating large covariance matrices has been a longstanding important p...
research
05/01/2018

Multi-Step Knowledge-Aided Iterative ESPRIT for Direction Finding

In this work, we propose a subspace-based algorithm for DOA estimation w...

Please sign up or login with your details

Forgot password? Click here to reset