Soft Actor-Critic With Integer Actions

09/17/2021
by   Ting-Han Fan, et al.
0

Reinforcement learning is well-studied under discrete actions. Integer actions setting is popular in the industry yet still challenging due to its high dimensionality. To this end, we study reinforcement learning under integer actions by incorporating the Soft Actor-Critic (SAC) algorithm with an integer reparameterization. Our key observation for integer actions is that their discrete structure can be simplified using their comparability property. Hence, the proposed integer reparameterization does not need one-hot encoding and is of low dimensionality. Experiments show that the proposed SAC under integer actions is as good as the continuous action version on robot control tasks and outperforms Proximal Policy Optimization on power distribution systems control tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/16/2019

Soft Actor-Critic for Discrete Action Settings

Soft Actor-Critic is a state-of-the-art reinforcement learning algorithm...
research
04/03/2020

Reinforcement Learning for Mixed-Integer Problems Based on MPC

Model Predictive Control has been recently proposed as policy approximat...
research
05/02/2023

Mixed-Integer Optimal Control via Reinforcement Learning: A Case Study on Hybrid Vehicle Energy Management

Many optimal control problems require the simultaneous output of continu...
research
12/23/2019

Discrete and Continuous Action Representation for Practical RL in Video Games

While most current research in Reinforcement Learning (RL) focuses on im...
research
07/02/2019

Modified Actor-Critics

Robot Learning, from a control point of view, often involves continuous ...
research
04/13/2021

Bi-level Off-policy Reinforcement Learning for Volt/VAR Control Involving Continuous and Discrete Devices

In Volt/Var control (VVC) of active distribution networks(ADNs), both sl...
research
11/07/2010

Reinforcement Learning Based on Active Learning Method

In this paper, a new reinforcement learning approach is proposed which i...

Please sign up or login with your details

Forgot password? Click here to reset