Learning Action-Transferable Policy with Action Embedding

09/05/2019
by   Yu Chen, et al.
0

Despite achieving great success on performance in various sequential decision task, deep reinforcement learning is extremely data inefficient. Many approaches have been proposed to improve the data efficiency, e.g. transfer learning which utilizes knowledge learned from related tasks to accelerate training. Previous researches on transfer learning mostly attempt to learn a common feature space of states across related tasks to exploit knowledge as much as possible. However, semantic information of actions may be shared as well, even between tasks with different action space size. In this work, we first propose a method to learn action embedding for discrete actions in RL from generated trajectories without any prior knowledge, and then leverage it to transfer policy across tasks with different state space and/or discrete action space. We validate our method on a set of gridworld navigation tasks, discretized continuous control tasks and fighting tasks in a commercial video game. Our experimental results show that our method can effectively learn informative action embeddings and accelerate learning by policy transfer across tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/19/2020

Efficient Deep Reinforcement Learning through Policy Transfer

Transfer Learning (TL) has shown great potential to accelerate Reinforce...
research
01/06/2021

Learn Dynamic-Aware State Embedding for Transfer Learning

Transfer reinforcement learning aims to improve the sample efficiency of...
research
05/25/2021

Transfer Learning and Curriculum Learning in Sokoban

Transfer learning can speed up training in machine learning and is regul...
research
02/19/2019

DOM-Q-NET: Grounded RL on Structured Language

Building agents to interact with the web would allow for significant imp...
research
01/11/2018

Model-Based Action Exploration

Deep reinforcement learning has great stride in solving challenging moti...
research
06/12/2020

Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch

Deep reinforcement learning (RL) algorithms have achieved great success ...
research
07/13/2020

DinerDash Gym: A Benchmark for Policy Learning in High-Dimensional Action Space

It has been arduous to assess the progress of a policy learning algorith...

Please sign up or login with your details

Forgot password? Click here to reset