Component Transfer Learning for Deep RL Based on Abstract Representations

11/22/2021
by   Geoffrey van Driessel, et al.
8

In this work we investigate a specific transfer learning approach for deep reinforcement learning in the context where the internal dynamics between two tasks are the same but the visual representations differ. We learn a low-dimensional encoding of the environment, meant to capture summarizing abstractions, from which the internal dynamics and value functions are learned. Transfer is then obtained by freezing the learned internal dynamics and value functions, thus reusing the shared low-dimensional embedding space. When retraining the encoder for transfer, we make several observations: (i) in some cases, there are local minima that have small losses but a mismatching embedding space, resulting in poor task performance and (ii) in the absence of local minima, the output of the encoder converges in our experiments to the same embedding space, which leads to a fast and efficient transfer as compared to learning from scratch. The local minima are caused by the reduced degree of freedom of the optimization process caused by the frozen models. We also find that the transfer performance is heavily reliant on the base model; some base models often result in a successful transfer, whereas other base models often result in a failing transfer.

READ FULL TEXT

page 4

page 7

page 8

page 16

research
09/12/2018

Combined Reinforcement Learning via Abstract Representations

In the quest for efficient and robust reinforcement learning methods, bo...
research
08/14/2021

Fractional Transfer Learning for Deep Model-Based Reinforcement Learning

Reinforcement learning (RL) is well known for requiring large amounts of...
research
06/26/2019

No Pressure! Addressing the Problem of Local Minima in Manifold Learning Algorithms

Nonlinear embedding manifold learning methods provide invaluable visual ...
research
10/26/2018

Transfer of Deep Reactive Policies for MDP Planning

Domain-independent probabilistic planners input an MDP description in a ...
research
07/12/2020

Learning Abstract Models for Strategic Exploration and Fast Reward Transfer

Model-based reinforcement learning (RL) is appealing because (i) it enab...
research
07/10/2017

Improving speaker turn embedding by crossmodal transfer learning from face embedding

Learning speaker turn embeddings has shown considerable improvement in s...
research
03/14/2012

Evolving Culture vs Local Minima

We propose a theory that relates difficulty of learning in deep architec...

Please sign up or login with your details

Forgot password? Click here to reset