Discrete and Continuous Action Representation for Practical RL in Video Games

12/23/2019
by   Olivier Delalleau, et al.
0

While most current research in Reinforcement Learning (RL) focuses on improving the performance of the algorithms in controlled environments, the use of RL under constraints like those met in the video game industry is rarely studied. Operating under such constraints, we propose Hybrid SAC, an extension of the Soft Actor-Critic algorithm able to handle discrete, continuous and parameterized actions in a principled way. We show that Hybrid SAC can successfully solve a highspeed driving task in one of our games, and is competitive with the state-of-the-art on parameterized actions benchmark tasks. We also explore the impact of using normalizing flows to enrich the expressiveness of the policy at minimal computational cost, and identify a potential undesired effect of SAC when used with normalizing flows, that may be addressed by optimizing a different objective.

READ FULL TEXT
research
09/17/2021

Soft Actor-Critic With Integer Actions

Reinforcement learning is well-studied under discrete actions. Integer a...
research
01/07/2021

Attention Actor-Critic algorithm for Multi-Agent Constrained Co-operative Reinforcement Learning

In this work, we consider the problem of computing optimal actions for R...
research
05/05/2018

Deep Reinforcement Learning for Playing 2.5D Fighting Games

Deep reinforcement learning has shown its success in game playing. Howev...
research
04/18/2023

Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints

This study presents a benchmark for evaluating action-constrained reinfo...
research
08/08/2023

Actor-Critic with variable time discretization via sustained actions

Reinforcement learning (RL) methods work in discrete time. In order to a...
research
10/07/2019

Reinforcement Learning with Structured Hierarchical Grammar Representations of Actions

From a young age humans learn to use grammatical principles to hierarchi...

Please sign up or login with your details

Forgot password? Click here to reset