Dealing with Range Anxiety in Mean Estimation via Statistical Queries

11/20/2016
by   Vitaly Feldman, et al.
0

We give algorithms for estimating the expectation of a given real-valued function ϕ:X→ R on a sample drawn randomly from some unknown distribution D over domain X, namely E_ x∼ D[ϕ( x)]. Our algorithms work in two well-studied models of restricted access to data samples. The first one is the statistical query (SQ) model in which an algorithm has access to an SQ oracle for the input distribution D over X instead of i.i.d. samples from D. Given a query function ϕ:X → [0,1], the oracle returns an estimate of E_ x∼ D[ϕ( x)] within some tolerance τ. The second, is a model in which only a single bit is communicated from each sample. In both of these models the error obtained using a naive implementation would scale polynomially with the range of the random variable ϕ( x) (which might even be infinite). In contrast, without restrictions on access to data the expected error scales with the standard deviation of ϕ( x). Here we give a simple algorithm whose error scales linearly in standard deviation of ϕ( x) and logarithmically with an upper bound on the second moment of ϕ( x). As corollaries, we obtain algorithms for high dimensional mean estimation and stochastic convex optimization in these models that work in more general settings than previously known solutions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/07/2016

A General Characterization of the Statistical Query Complexity

Statistical query (SQ) algorithms are algorithms that have access to an ...
research
08/15/2019

Robust estimation of the mean with bounded relative standard deviation

Many randomized approximation algorithms operate by giving a procedure f...
research
02/07/2019

On Mean Estimation for General Norms with Statistical Queries

We study the problem of mean estimation for high-dimensional distributio...
research
06/02/2022

A Scalable Shannon Entropy Estimator

We revisit the well-studied problem of estimating the Shannon entropy of...
research
03/21/2020

Black-box Methods for Restoring Monotonicity

In many practical applications, heuristic or approximation algorithms ar...
research
05/11/2022

Query Efficient Prophet Inequality with Unknown I.I.D. Distributions

We study the single-choice prophet inequality problem, where a gambler f...
research
02/03/2021

Query Complexity of Least Absolute Deviation Regression via Robust Uniform Convergence

Consider a regression problem where the learner is given a large collect...

Please sign up or login with your details

Forgot password? Click here to reset