Cross-Domain Transfer in Reinforcement Learning using Target Apprentice

01/22/2018
by   Girish Joshi, et al.
0

In this paper, we present a new approach to Transfer Learning (TL) in Reinforcement Learning (RL) for cross-domain tasks. Many of the available techniques approach the transfer architecture as a method of speeding up the target task learning. We propose to adapt and reuse the mapped source task optimal-policy directly in related domains. We show the optimal policy from a related source task can be near optimal in target domain provided an adaptive policy accounts for the model error between target and source. The main benefit of this policy augmentation is generalizing policies across multiple related domains without having to re-learn the new tasks. Our results show that this architecture leads to better sample efficiency in the transfer, reducing sample complexity of target task learning to target apprentice learning.

READ FULL TEXT
research
06/11/2018

Context-Aware Policy Reuse

Transfer learning can greatly speed up reinforcement learning for a new ...
research
05/29/2022

Provable Benefits of Representational Transfer in Reinforcement Learning

We study the problem of representational transfer in RL, where an agent ...
research
05/10/2021

Adaptive Policy Transfer in Reinforcement Learning

Efficient and robust policy transfer remains a key challenge for reinfor...
research
07/06/2021

AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning

Most approaches in reinforcement learning (RL) are data-hungry and speci...
research
11/26/2022

Transfer RL via the Undo Maps Formalism

Transferring knowledge across domains is one of the most fundamental pro...
research
09/24/2017

An Optimal Online Method of Selecting Source Policies for Reinforcement Learning

Transfer learning significantly accelerates the reinforcement learning p...
research
10/10/2015

Attend, Adapt and Transfer: Attentive Deep Architecture for Adaptive Transfer from multiple sources in the same domain

Transferring knowledge from prior source tasks in solving a new target t...

Please sign up or login with your details

Forgot password? Click here to reset