Stochastic Zeroth Order Gradient and Hessian Estimators: Variance Reduction and Refined Bias Bounds

05/29/2022
by   Yasong Feng, et al.
0

We study stochastic zeroth order gradient and Hessian estimators for real-valued functions in ℝ^n. We show that, via taking finite difference along random orthogonal directions, the variance of the stochastic finite difference estimators can be significantly reduced. In particular, we design estimators for smooth functions such that, if one uses Θ( k ) random directions sampled from the Stiefel's manifold St (n,k) and finite-difference granularity δ, the variance of the gradient estimator is bounded by 𝒪( ( n/k - 1 ) + ( n^2/k - n ) δ^2 + n^2 δ^4 / k ), and the variance of the Hessian estimator is bounded by 𝒪( ( n^2/k^2 - 1 ) + ( n^4/k^2 - n^2 ) δ^2 + n^4 δ^4 /k^2). When k = n, the variances become negligibly small. In addition, we provide improved bias bounds for the estimators. The bias of both gradient and Hessian estimators for smooth function f is of order 𝒪( δ^2 Γ), where δ is the finite-difference granularity, and Γ depends on high order derivatives of f. Our results are evidenced by empirical observations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/26/2022

Towards Sharp Stochastic Zeroth Order Hessian Estimators over Riemannian Manifolds

We study Hessian estimators for real-valued functions defined over an n-...
research
01/31/2019

Stochastic Recursive Variance-Reduced Cubic Regularization Methods

Stochastic Variance-Reduced Cubic regularization (SVRC) algorithms have ...
research
12/20/2022

Generalized Simultaneous Perturbation Stochastic Approximation with Reduced Estimator Bias

We present in this paper a family of generalized simultaneous perturbati...
research
10/10/2018

Rao-Blackwellized Stochastic Gradients for Discrete Distributions

We wish to compute the gradient of an expectation over a finite or count...
research
06/19/2019

Variances of surface area estimators based on pixel configuration counts

The surface area of a set which is only observed as a binary pixel image...
research
11/30/2021

Martingale product estimators for sensitivity analysis in computational statistical physics

We introduce a new class of estimators for the linear response of steady...
research
09/23/2019

Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Estimators for Reinforcement Learning

Gradient-based methods for optimisation of objectives in stochastic sett...

Please sign up or login with your details

Forgot password? Click here to reset