A0C: Alpha Zero in Continuous Action Space

05/24/2018
by   Thomas M. Moerland, et al.
0

A core novelty of Alpha Zero is the interleaving of tree search and deep learning, which has proven very successful in board games like Chess, Shogi and Go. These games have a discrete action space. However, many real-world reinforcement learning domains have continuous action spaces, for example in robotic control, navigation and self-driving cars. This paper presents the necessary theoretical extensions of Alpha Zero to deal with continuous action space. We also provide some preliminary experiments on the Pendulum swing-up task, empirically showing the feasibility of our approach. Thereby, this work provides a first step towards the application of iterated search and learning in domains with a continuous action space.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/10/2018

Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space

Most existing deep reinforcement learning (DRL) frameworks consider eith...
research
10/19/2020

Dream and Search to Control: Latent Space Planning for Continuous Control

Learning and planning with latent space dynamics has been shown to be us...
research
11/13/2015

Deep Reinforcement Learning in Parameterized Action Space

Recent work has shown that deep neural networks are capable of approxima...
research
02/05/2022

Transfer Reinforcement Learning for Differing Action Spaces via Q-Network Representations

Transfer learning approaches in reinforcement learning aim to assist age...
research
10/29/2020

Deep Jump Q-Evaluation for Offline Policy Evaluation in Continuous Action Space

We consider off-policy evaluation (OPE) in continuous action domains, su...
research
03/31/2020

Exploration in Action Space

Parameter space exploration methods with black-box optimization have rec...
research
05/23/2022

Learning Long-Horizon Robot Exploration Strategies for Multi-Object Search in Continuous Action Spaces

Recent advances in vision-based navigation and exploration have shown im...

Please sign up or login with your details

Forgot password? Click here to reset