ACERAC: Efficient reinforcement learning in fine time discretization

04/08/2021
by   Paweł Wawrzyński, et al.
0

We propose a framework for reinforcement learning (RL) in fine time discretization and a learning algorithm in this framework. One of the main goals of RL is to provide a way for physical machines to learn optimal behavior instead of being programmed. However, the machines are usually controlled in fine time discretization. The most common RL methods apply independent random elements to each action, which is not suitable in that setting. It is not feasible because it causes the controlled system to jerk, and does not ensure sufficient exploration since a single action is not long enough to create a significant experience that could be translated into policy improvement. In the RL framework introduced in this paper, policies are considered that produce actions based on states and random elements autocorrelated in subsequent time instants. The RL algorithm introduced here approximately optimizes such a policy. The efficiency of this algorithm is verified against three other RL methods (PPO, SAC, ACER) in four simulated learning control problems (Ant, HalfCheetah, Hopper, and Walker2D) in diverse time discretization. The algorithm introduced here outperforms the competitors in most cases considered.

READ FULL TEXT
research
09/10/2020

A framework for reinforcement learning with autocorrelated actions

The subject of this paper is reinforcement learning. Policies are consid...
research
08/08/2023

Actor-Critic with variable time discretization via sustained actions

Reinforcement learning (RL) methods work in discrete time. In order to a...
research
02/23/2023

Diverse Policy Optimization for Structured Action Space

Enhancing the diversity of policies is beneficial for robustness, explor...
research
01/09/2021

Identifying Decision Points for Safe and Interpretable Reinforcement Learning in Hypotension Treatment

Many batch RL health applications first discretize time into fixed inter...
research
05/09/2020

Reinforcement Learning for Thermostatically Controlled Loads Control using Modelica and Python

The aim of the project is to investigate and assess opportunities for ap...
research
07/07/2022

Robust optimal well control using an adaptive multi-grid reinforcement learning framework

Reinforcement learning (RL) is a promising tool to solve robust optimal ...
research
10/15/2022

A multilevel reinforcement learning framework for PDE based control

Reinforcement learning (RL) is a promising method to solve control probl...

Please sign up or login with your details

Forgot password? Click here to reset