Covariance Matrix Estimation with Non Uniform and Data Dependent Missing Observations

10/01/2019
by   Eduardo Pavez, et al.
0

In this paper we study covariance estimation with missing data. We consider various types of missing data mechanisms, which may depend or not on the observed data, and have a time varying distribution. Additionally, observed variables may have arbitrary (non uniform) and dependent observation probabilities. For each missing data mechanism we use a different estimator, and we obtain bounds for the expected value of the estimation error in operator norm. Our bounds are equivalent (up to constant and logarithmic factors), to state of the art bounds for complete and uniform missing observations. Furthermore, for the more general non uniform and dependent case, the proposed bounds are new or improve upon previous results. Our bounds depend on quantities we call scaled effective rank, which generalize the effective rank to account for missing observations. We show that all the estimators studied in this work have the same asymptotic convergence rate (up to logarithmic factors).

READ FULL TEXT
research
11/01/2018

HMLasso: Lasso for High Dimensional and Highly Missing Data

Sparse regression such as Lasso has achieved great success in dealing wi...
research
07/05/2022

Handling Nonmonotone Missing Data with Available Complete-Case Missing Value Assumption

Nonmonotone missing data is a common problem in scientific studies. The ...
research
09/09/2022

Deep Learning with Non-Linear Factor Models: Adaptability and Avoidance of Curse of Dimensionality

In this paper, we connect deep learning literature with non-linear facto...
research
06/10/2022

Provable Guarantees for Sparsity Recovery with Deterministic Missing Data Patterns

We study the problem of consistently recovering the sparsity pattern of ...
research
06/08/2020

Estimating High-dimensional Covariance and Precision Matrices under General Missing Dependence

A sample covariance matrix S of completely observed data is the key stat...
research
11/11/2018

Fast Matrix Factorization with Non-Uniform Weights on Missing Data

Matrix factorization (MF) has been widely used to discover the low-rank ...
research
06/19/2019

First order covariance inequalities via Stein's method

We propose probabilistic representations for inverse Stein operators (i....

Please sign up or login with your details

Forgot password? Click here to reset