Generalized Bayesian Posterior Expectation Distillation for Deep Neural Networks

05/16/2020
by   Meet P. Vadera, et al.
0

In this paper, we present a general framework for distilling expectations with respect to the Bayesian posterior distribution of a deep neural network classifier, extending prior work on the Bayesian Dark Knowledge framework. The proposed framework takes as input "teacher" and student model architectures and a general posterior expectation of interest. The distillation method performs an online compression of the selected posterior expectation using iteratively generated Monte Carlo samples. We focus on the posterior predictive distribution and expected entropy as distillation targets. We investigate several aspects of this framework including the impact of uncertainty and the choice of student model architecture. We study methods for student model architecture search from a speed-storage-accuracy perspective and evaluate down-stream tasks leveraging entropy distillation including uncertainty ranking and out-of-distribution detection.

READ FULL TEXT

page 11

page 13

page 20

research
06/04/2019

Assessing the Robustness of Bayesian Dark Knowledge to Posterior Uncertainty

Bayesian Dark Knowledge is a method for compressing the posterior predic...
research
06/27/2018

Adversarial Distillation of Bayesian Neural Network Posteriors

Bayesian neural networks (BNNs) allow us to reason about uncertainty in ...
research
03/28/2023

DisWOT: Student Architecture Search for Distillation WithOut Training

Knowledge distillation (KD) is an effective training strategy to improve...
research
06/14/2015

Bayesian Dark Knowledge

We consider the problem of Bayesian parameter estimation for deep neural...
research
06/27/2022

Revisiting Architecture-aware Knowledge Distillation: Smaller Models and Faster Search

Knowledge Distillation (KD) has recently emerged as a popular method for...
research
09/04/2023

Efficient computation of predictive probabilities in probit models via expectation propagation

Binary regression models represent a popular model-based approach for bi...
research
03/29/2021

Rapid Risk Minimization with Bayesian Models Through Deep Learning Approximation

In this paper, we introduce a novel combination of Bayesian Models (BMs)...

Please sign up or login with your details

Forgot password? Click here to reset