Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Uncertainty

10/12/2021
by   Jeffrey Ryan Willette, et al.
0

Numerous recent works utilize bi-Lipschitz regularization of neural network layers to preserve relative distances between data instances in the feature spaces of each layer. This distance sensitivity with respect to the data aids in tasks such as uncertainty calibration and out-of-distribution (OOD) detection. In previous works, features extracted with a distance sensitive model are used to construct feature covariance matrices which are used in deterministic uncertainty estimation or OOD detection. However, in cases where there is a distribution over tasks, these methods result in covariances which are sub-optimal, as they may not leverage all of the meta information which can be shared among tasks. With the use of an attentive set encoder, we propose to meta learn either diagonal or diagonal plus low-rank factors to efficiently construct task specific covariance matrices. Additionally, we propose an inference procedure which utilizes scaled energy to achieve a final predictive distribution which can better separate OOD data, and is well calibrated under a distributional dataset shift.

READ FULL TEXT

page 2

page 16

page 17

page 19

page 20

page 21

page 22

page 23

research
11/11/2018

SLANG: Fast Structured Covariance Approximations for Bayesian Deep Learning with Natural Gradient

Uncertainty estimation in large deep-learning models is a computationall...
research
10/07/2022

Private and Efficient Meta-Learning with Low Rank and Sparse Decomposition

Meta-learning is critical for a variety of practical ML systems – like p...
research
10/04/2022

Uncertainty-Aware Meta-Learning for Multimodal Task Distributions

Meta-learning or learning to learn is a popular approach for learning ne...
research
08/02/2021

Learning to Learn to Demodulate with Uncertainty Quantification via Bayesian Meta-Learning

Meta-learning, or learning to learn, offers a principled framework for f...
research
04/25/2020

Low-rank multi-parametric covariance identification

We propose a differential geometric construction for families of low-ran...
research
06/18/2012

Adaptive Regularization for Weight Matrices

Algorithms for learning distributions over weight-vectors, such as AROW ...

Please sign up or login with your details

Forgot password? Click here to reset