GO Hessian for Expectation-Based Objectives

06/16/2020
by   Yulai Cong, et al.
1

An unbiased low-variance gradient estimator, termed GO gradient, was proposed recently for expectation-based objectives E_q_γ(y) [f(y)], where the random variable (RV) y may be drawn from a stochastic computation graph with continuous (non-reparameterizable) internal nodes and continuous/discrete leaves. Upgrading the GO gradient, we present for E_q_γ(y) [f(y)] an unbiased low-variance Hessian estimator, named GO Hessian. Considering practical implementation, we reveal that GO Hessian is easy-to-use with auto-differentiation and Hessian-vector products, enabling efficient cheap exploitation of curvature information over stochastic computation graphs. As representative examples, we present the GO Hessian for non-reparameterizable gamma and negative binomial RVs/nodes. Based on the GO Hessian, we design a new second-order method for E_q_γ(y) [f(y)], with rigorous experiments conducted to verify its effectiveness and efficiency.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/22/2023

Convergence of Hessian estimator from random samples on a manifold

We provide a systematic convergence analysis of the Hessian operator est...
research
09/06/2021

Unbiased Estimation of the Hessian for Partially Observed Diffusions

In this article we consider the development of unbiased estimators of th...
research
07/29/2020

A new framework for the computation of Hessians

We investigate the computation of Hessian matrices via Automatic Differe...
research
09/09/2015

Fast Second-Order Stochastic Backpropagation for Variational Inference

We propose a second-order (Hessian or Hessian-free) based optimization m...
research
06/27/2012

Estimating the Hessian by Back-propagating Curvature

In this work we develop Curvature Propagation (CP), a general technique ...
research
12/20/2020

Discrete Hessian complexes in three dimensions

One conforming and one non-conforming virtual element Hessian complexes ...
research
10/20/2017

Tracking the gradients using the Hessian: A new look at variance reducing stochastic methods

Our goal is to improve variance reducing stochastic methods through bett...

Please sign up or login with your details

Forgot password? Click here to reset