Does Knowledge Transfer Always Help to Learn a Better Policy?

12/06/2019
by   Fei Feng, et al.
21

One of the key approaches to save samples when learning a policy for a reinforcement learning problem is to use knowledge from an approximate model such as its simulator. However, does knowledge transfer from approximate models always help to learn a better policy? Despite numerous empirical studies of transfer reinforcement learning, an answer to this question is still elusive. In this paper, we provide a strong negative result, showing that even the full knowledge of an approximate model may not help reduce the number of samples for learning an accurate policy of the true model. We construct an example of reinforcement learning models and show that the complexity with or without knowledge transfer has the same order. On the bright side, effective knowledge transferring is still possible under additional assumptions. In particular, we demonstrate that knowing the (linear) bases of the true model significantly reduces the number of samples for learning an accurate policy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/10/2021

Adaptive Policy Transfer in Reinforcement Learning

Efficient and robust policy transfer remains a key challenge for reinfor...
research
02/11/2021

Sufficiently Accurate Model Learning for Planning

Data driven models of dynamical systems help planners and controllers to...
research
12/07/2010

Bridging the Gap between Reinforcement Learning and Knowledge Representation: A Logical Off- and On-Policy Framework

Knowledge Representation is important issue in reinforcement learning. I...
research
04/16/2022

Efficient Bayesian Policy Reuse with a Scalable Observation Model in Deep Reinforcement Learning

Bayesian policy reuse (BPR) is a general policy transfer framework for s...
research
10/22/2017

Searching for effective and efficient way of knowledge transfer within an organization

In this paper three models of knowledge transfer in organization are con...
research
07/04/2018

Transfer with Model Features in Reinforcement Learning

A key question in Reinforcement Learning is which representation an agen...
research
09/06/2018

ANS: Adaptive Network Scaling for Deep Rectifier Reinforcement Learning Models

This work provides a thorough study on how reward scaling can affect per...

Please sign up or login with your details

Forgot password? Click here to reset