Efficient Statistics for Sparse Graphical Models from Truncated Samples

06/17/2020
by   Arnab Bhattacharyya, et al.
0

In this paper, we study high-dimensional estimation from truncated samples. We focus on two fundamental and classical problems: (i) inference of sparse Gaussian graphical models and (ii) support recovery of sparse linear models. (i) For Gaussian graphical models, suppose d-dimensional samples x are generated from a Gaussian N(μ,Σ) and observed only if they belong to a subset S ⊆ℝ^d. We show that μ and Σ can be estimated with error ϵ in the Frobenius norm, using Õ(nz(Σ^-1)/ϵ^2) samples from a truncated 𝒩(μ,Σ) and having access to a membership oracle for S. The set S is assumed to have non-trivial measure under the unknown distribution but is otherwise arbitrary. (ii) For sparse linear regression, suppose samples ( x,y) are generated where y = x^⊤Ω^* + 𝒩(0,1) and ( x, y) is seen only if y belongs to a truncation set S ⊆ℝ. We consider the case that Ω^* is sparse with a support set of size k. Our main result is to establish precise conditions on the problem dimension d, the support size k, the number of observations n, and properties of the samples and the truncation that are sufficient to recover the support of Ω^*. Specifically, we show that under some mild assumptions, only O(k^2 log d) samples are needed to estimate Ω^* in the ℓ_∞-norm up to a bounded error. For both problems, our estimator minimizes the sum of the finite population negative log-likelihood function and an ℓ_1-regularization term.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/11/2018

Efficient Statistics, in High Dimensions, from Truncated Samples

We provide an efficient algorithm for the classical problem, going back ...
research
06/22/2020

Support Union Recovery in Meta Learning of Gaussian Graphical Models

In this paper we study Meta learning of Gaussian graphical models. In ou...
research
02/12/2014

Sparse Estimation From Noisy Observations of an Overdetermined Linear System

This note studies a method for the efficient estimation of a finite numb...
research
08/02/2019

Efficient Truncated Statistics with Unknown Truncation

We study the problem of estimating the parameters of a Gaussian distribu...
research
12/02/2021

Optimal regularizations for data generation with probabilistic graphical models

Understanding the role of regularization is a central question in Statis...
research
05/03/2019

Learning Some Popular Gaussian Graphical Models without Condition Number Bounds

Gaussian Graphical Models (GGMs) have wide-ranging applications in machi...
research
03/29/2023

Module-based regularization improves Gaussian graphical models when observing noisy data

Researchers often represent relations in multi-variate correlational dat...

Please sign up or login with your details

Forgot password? Click here to reset