An Autonomous Non-monolithic Agent with Multi-mode Exploration based on Options Framework

05/02/2023
by   Jaeyoon Kim, et al.
0

Most exploration research on reinforcement learning (RL) has paid attention to `the way of exploration', which is `how to explore'. The other exploration research, `when to explore', has not been the main focus of RL exploration research. The issue of `when' of a monolithic exploration in the usual RL exploration behaviour binds an exploratory action to an exploitational action of an agent. Recently, a non-monolithic exploration research has emerged to examine the mode-switching exploration behaviour of humans and animals. The ultimate purpose of our research is to enable an agent to decide when to explore or exploit autonomously. We describe the initial research of an autonomous multi-mode exploration of non-monolithic behaviour in an options framework. The higher performance of our method is shown against the existing non-monolithic exploration method through comparative experimental results.

READ FULL TEXT
research
08/26/2021

When should agents explore?

Exploration remains a central challenge for reinforcement learning (RL)....
research
03/29/2022

When to Go, and When to Explore: The Benefit of Post-Exploration in Intrinsic Motivation

Go-Explore achieved breakthrough performance on challenging reinforcemen...
research
12/06/2022

First Go, then Post-Explore: the Benefits of Post-Exploration in Intrinsic Motivation

Go-Explore achieved breakthrough performance on challenging reinforcemen...
research
06/14/2020

Non-local Policy Optimization via Diversity-regularized Collaborative Exploration

Conventional Reinforcement Learning (RL) algorithms usually have one sin...
research
07/03/2017

Hashing Over Predicted Future Frames for Informed Exploration of Deep Reinforcement Learning

In reinforcement learning (RL) tasks, an efficient exploration mechanism...
research
06/13/2022

Intrinsically motivated option learning: a comparative study of recent methods

Options represent a framework for reasoning across multiple time scales ...
research
01/26/2023

Deep Laplacian-based Options for Temporally-Extended Exploration

Selecting exploratory actions that generate a rich stream of experience ...

Please sign up or login with your details

Forgot password? Click here to reset