DeepAI AI Chat
Log In Sign Up

Transfer Learning by Modeling a Distribution over Policies

by   Disha Shrivastava, et al.

Exploration and adaptation to new tasks in a transfer learning setup is a central challenge in reinforcement learning. In this work, we build on the idea of modeling a distribution over policies in a Bayesian deep reinforcement learning setup to propose a transfer strategy. Recent works have shown to induce diversity in the learned policies by maximizing the entropy of a distribution of policies (Bachman et al., 2018; Garnelo et al., 2018) and thus, we postulate that our proposed approach leads to faster exploration resulting in improved transfer learning. We support our hypothesis by demonstrating favorable experimental results on a variety of settings on fully-observable GridWorld and partially observable MiniGrid (Chevalier-Boisvert et al., 2018) environments.


page 2

page 4

page 6


Learning Approximate and Exact Numeral Systems via Reinforcement Learning

Recent work (Xu et al., 2020) has suggested that numeral systems in diff...

Universal Successor Representations for Transfer Reinforcement Learning

The objective of transfer reinforcement learning is to generalize from a...

Supervised and Unsupervised Transfer Learning for Question Answering

Although transfer learning has been shown to be successful for tasks lik...

Universal Successor Features for Transfer Reinforcement Learning

Transfer in Reinforcement Learning (RL) refers to the idea of applying k...

Transfer Learning for Endoscopic Image Classification

In this paper we propose a method for transfer learning of endoscopic im...

Crossmodal Attentive Skill Learner

This paper presents the Crossmodal Attentive Skill Learner (CASL), integ...

Adaptive Transfer Learning for Plant Phenotyping

Plant phenotyping (Guo et al. 2021; Pieruschka et al. 2019) focuses on s...