Group Invariance and Computational Sufficiency

07/16/2018
by   Vincent Q. Vu, et al.
0

Statistical sufficiency formalizes the notion of data reduction. In the decision theoretic interpretation, once a model is chosen all inferences should be based on a sufficient statistic. However, suppose we start with a set of procedures rather than a specific model. Is it possible to reduce the data and yet still be able to compute all of the procedures? In other words, what functions of the data contain all of the information sufficient for computing these procedures? This article presents some progress towards a theory of "computational sufficiency" and shows that strong reductions can be made for large classes of penalized M-estimators by exploiting hidden symmetries in the underlying optimization problems. These reductions can (1) reveal hidden connections between seemingly disparate methods, (2) enable efficient computation, (3) give a different perspective on understanding procedures in a model-free setting. As a main example, the theory provides a surprising answer to the following question: "What do the Graphical Lasso, sparse PCA, single-linkage clustering, and L1 penalized Ising model selection all have in common?"

READ FULL TEXT

page 4

page 5

research
07/19/2013

The Cluster Graphical Lasso for improved estimation of Gaussian graphical models

We consider the task of estimating a Gaussian graphical model in the hig...
research
05/16/2020

Reducibility and Statistical-Computational Gaps from Secret Leakage

Inference problems with conjectured statistical-computational gaps are u...
research
10/07/2021

AgFlow: Fast Model Selection of Penalized PCA via Implicit Regularization Effects of Gradient Flow

Principal component analysis (PCA) has been widely used as an effective ...
research
04/20/2020

The Geometry of Uniqueness and Model Selection of Penalized Estimators including SLOPE, LASSO, and Basis Pursuit

We provide a necessary and sufficient condition for the uniqueness of pe...
research
10/30/2017

Solution of linear ill-posed problems by model selection and aggregation

We consider a general statistical linear inverse problem, where the solu...
research
02/27/2022

Strong Consistency for a Class of Adaptive Clustering Procedures

We introduce a class of clustering procedures which includes k-means and...
research
09/30/2013

On statistics, computation and scalability

How should statistical procedures be designed so as to be scalable compu...

Please sign up or login with your details

Forgot password? Click here to reset