The Hierarchical Adaptive Forgetting Variational Filter

05/15/2018
by   Vincent Moens, et al.
0

A common problem in Machine Learning and statistics consists in detecting whether the current sample in a stream of data belongs to the same distribution as previous ones, is an isolated outlier or inaugurates a new distribution of data. We present a hierarchical Bayesian algorithm that aims at learning a time-specific approximate posterior distribution of the parameters describing the distribution of the data observed. We derive the update equations of the variational parameters of the approximate posterior at each time step for models from the exponential family, and show that these updates find interesting correspondents in Reinforcement Learning (RL). In this perspective, our model can be seen as a hierarchical RL algorithm that learns a posterior distribution according to a certain stability confidence that is, in turn, learned according to its own stability confidence. Finally, we show some applications of our generic model, first in a RL context, next with an adaptive Bayesian Autoregressive model, and finally in the context of Stochastic Gradient Descent optimization.

READ FULL TEXT
research
06/28/2012

Fixed-Form Variational Posterior Approximation through Stochastic Linear Regression

We propose a general algorithm for approximating nonstandard Bayesian po...
research
10/20/2022

Model-based Lifelong Reinforcement Learning with Bayesian Exploration

We propose a model-based lifelong reinforcement-learning approach that e...
research
05/29/2023

Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo

We present a scalable and effective exploration strategy based on Thomps...
research
03/02/2023

Variational EP with Probabilistic Backpropagation for Bayesian Neural Networks

I propose a novel approach for nonlinear Logistic regression using a two...
research
09/01/2022

Dynamics-Adaptive Continual Reinforcement Learning via Progressive Contextualization

A key challenge of continual reinforcement learning (CRL) in dynamic env...
research
05/30/2018

Context Exploitation using Hierarchical Bayesian Models

We consider the problem of how to improve automatic target recognition b...
research
12/20/2022

Variational Factorization Machines for Preference Elicitation in Large-Scale Recommender Systems

Factorization machines (FMs) are a powerful tool for regression and clas...

Please sign up or login with your details

Forgot password? Click here to reset