More Efficient Exploration with Symbolic Priors on Action Sequence Equivalences

10/20/2021
by   Toby Johnstone, et al.
0

Incorporating prior knowledge in reinforcement learning algorithms is mainly an open question. Even when insights about the environment dynamics are available, reinforcement learning is traditionally used in a tabula rasa setting and must explore and learn everything from scratch. In this paper, we consider the problem of exploiting priors about action sequence equivalence: that is, when different sequences of actions produce the same effect. We propose a new local exploration strategy calibrated to minimize collisions and maximize new state visitations. We show that this strategy can be computed at little cost, by solving a convex optimization problem. By replacing the usual epsilon-greedy strategy in a DQN, we demonstrate its potential in several environments with various dynamic structures.

READ FULL TEXT

page 8

page 17

page 18

research
02/03/2019

A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning

In this paper we consider the problem of how a reinforcement learning ag...
research
05/26/2023

Reinforcement Learning with Simple Sequence Priors

Everything else being equal, simpler models should be preferred over mor...
research
02/04/2014

Safe Exploration of State and Action Spaces in Reinforcement Learning

In this paper, we consider the important problem of safe exploration in ...
research
09/09/2022

A Memory-Related Multi-Task Method Based on Task-Agnostic Exploration

We pose a new question: Can agents learn how to combine actions from pre...
research
12/21/2019

Can Agents Learn by Analogy? An Inferable Model for PAC Reinforcement Learning

Model-based reinforcement learning algorithms make decisions by building...
research
06/15/2019

Injecting Prior Knowledge for Transfer Learning into Reinforcement Learning Algorithms using Logic Tensor Networks

Human ability at solving complex tasks is helped by priors on object and...
research
10/05/2020

AdaLead: A simple and robust adaptive greedy search algorithm for sequence design

Efficient design of biological sequences will have a great impact across...

Please sign up or login with your details

Forgot password? Click here to reset