Transformers as Policies for Variable Action Environments

01/09/2023
by   Niklas Zwingenberger, et al.
0

In this project we demonstrate the effectiveness of the transformer encoder as a viable architecture for policies in variable action environments. Using it, we train an agent using Proximal Policy Optimisation (PPO) on multiple maps against scripted opponents in the Gym-μRTS environment. The final agent is able to achieve a higher return using half the computational resources of the next-best RL agent, which used the GridNet architecture. The source code and pre-trained models are available here: https://github.com/NiklasZ/transformers-for-variable-action-envs

READ FULL TEXT

page 10

page 11

research
04/08/2020

Adaptive Transformers in RL

Recent developments in Transformers have opened new interesting areas of...
research
04/01/2022

Transformers for 1D Signals in Parkinson's Disease Detection from Gait

This paper focuses on the detection of Parkinson's disease based on the ...
research
08/28/2020

Next-Best View Policy for 3D Reconstruction

Manually selecting viewpoints or using commonly available flight planner...
research
04/25/2023

Centralized control for multi-agent RL in a complex Real-Time-Strategy game

Multi-agent Reinforcement learning (MARL) studies the behaviour of multi...
research
10/07/2021

Offline RL With Resource Constrained Online Deployment

Offline reinforcement learning is used to train policies in scenarios wh...
research
09/17/2020

ISCAS at SemEval-2020 Task 5: Pre-trained Transformers for Counterfactual Statement Modeling

ISCAS participated in two subtasks of SemEval 2020 Task 5: detecting cou...
research
04/27/2022

Learning to Parallelize in a Shared-Memory Environment with Transformers

In past years, the world has switched to many-core and multi-core shared...

Please sign up or login with your details

Forgot password? Click here to reset