Online Adaptation of Deep Architectures with Reinforcement Learning

08/08/2016
by   Thushan Ganegedara, et al.
0

Online learning has become crucial to many problems in machine learning. As more data is collected sequentially, quickly adapting to changes in the data distribution can offer several competitive advantages such as avoiding loss of prior knowledge and more efficient learning. However, adaptation to changes in the data distribution (also known as covariate shift) needs to be performed without compromising past knowledge already built in into the model to cope with voluminous and dynamic data. In this paper, we propose an online stacked Denoising Autoencoder whose structure is adapted through reinforcement learning. Our algorithm forces the network to exploit and explore favourable architectures employing an estimated utility function that maximises the accuracy of an unseen validation sequence. Different actions, such as Pool, Increment and Merge are available to modify the structure of the network. As we observe through a series of experiments, our approach is more responsive, robust, and principled than its counterparts for non-stationary as well as stationary data distributions. Experimental results indicate that our algorithm performs better at preserving gained prior knowledge and responding to changes in the data distribution.

READ FULL TEXT
research
12/18/2018

Deep Online Learning via Meta-Learning: Continual Adaptation for Model-Based RL

Humans and animals can learn complex predictive models that allow them t...
research
10/16/2020

Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data

Large scale deep learning provides a tremendous opportunity to improve t...
research
07/22/2019

Feature-Model-Guided Online Learning for Self-Adaptive Systems

A self-adaptive system can modify its own structure and behavior at runt...
research
02/10/2021

Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach

We propose a black-box reduction that turns a certain reinforcement lear...
research
06/20/2023

Adversarial Search and Track with Multiagent Reinforcement Learning in Sparsely Observable Environment

We study a search and tracking (S T) problem for a team of dynamic sea...
research
11/23/2021

Understanding the Impact of Data Distribution on Q-learning with Function Approximation

In this work, we focus our attention on the study of the interplay betwe...
research
11/07/2021

Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online Adaptation

We consider a new problem of adapting a human mesh reconstruction model ...

Please sign up or login with your details

Forgot password? Click here to reset