Analyzing statistical and computational tradeoffs of estimation procedures

06/25/2015
by   Daniel L. Sussman, et al.
0

The recent explosion in the amount and dimensionality of data has exacerbated the need of trading off computational and statistical efficiency carefully, so that inference is both tractable and meaningful. We propose a framework that provides an explicit opportunity for practitioners to specify how much statistical risk they are willing to accept for a given computational cost, and leads to a theoretical risk-computation frontier for any given inference problem. We illustrate the tradeoff between risk and computation and illustrate the frontier in three distinct settings. First, we derive analytic forms for the risk of estimating parameters in the classical setting of estimating the mean and variance for normally distributed data and for the more general setting of parameters of an exponential family. The second example concentrates on computationally constrained Hodges-Lehmann estimators. We conclude with an evaluation of risk associated with early termination of iterative matrix inversion algorithms in the context of linear regression.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/19/2018

Robust Estimation via Robust Gradient Estimation

We provide a new computationally-efficient class of estimators for risk ...
research
12/11/2007

PAC-Bayesian Bounds for Randomized Empirical Risk Minimizers

The aim of this paper is to generalize the PAC-Bayesian theorems proved ...
research
12/22/2020

MLE of Jointly Constrained Mean-Covariance of Multivariate Normal Distributions

Estimating the unconstrained mean and covariance matrix is a popular top...
research
10/20/2020

Unbiased estimation and backtesting of risk in the context of heavy tails

While the estimation of risk is an important question in the daily busin...
research
02/14/2019

Dualizing Le Cam's method, with applications to estimating the unseens

One of the most commonly used techniques for proving statistical lower b...
research
09/30/2018

Distributed linear regression by averaging

Modern massive datasets pose an enormous computational burden to practit...

Please sign up or login with your details

Forgot password? Click here to reset