Pgx: Hardware-accelerated parallel game simulation for reinforcement learning

03/29/2023
by   Sotetsu Koyamada, et al.
0

We propose Pgx, a collection of board game simulators written in JAX. Thanks to auto-vectorization and Just-In-Time compilation of JAX, Pgx scales easily to thousands of parallel execution on GPU/TPU accelerators. We found that the simulation of Pgx on a single A100 GPU is 10x faster than that of existing reinforcement learning libraries. Pgx implements games considered vital benchmarks in artificial intelligence research, such as Backgammon, Shogi, and Go. Pgx is available at https://github.com/sotetsuk/pgx.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/24/2023

Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation

Reinforcement learning is time-consuming for complex tasks due to the ne...
research
07/19/2019

GPU-Accelerated Atari Emulation for Reinforcement Learning

We designed and implemented a CUDA port of the Atari Learning Environmen...
research
08/15/2018

Deep RTS: A Game Environment for Deep Reinforcement Learning in Real-Time Strategy Games

Reinforcement learning (RL) is an area of research that has blossomed tr...
research
10/14/2018

Assessing the Potential of Classical Q-learning in General Game Playing

After the recent groundbreaking results of AlphaGo and AlphaZero, we hav...
research
03/23/2020

Neural Game Engine: Accurate learning of generalizable forward models from pixels

Access to a fast and easily copied forward model of a game is essential ...
research
03/23/2020

Neural Game Engine: Accurate learning ofgeneralizable forward models from pixels

Access to a fast and easily copied forward model of a game is essential ...
research
09/06/2022

TAPA: A Scalable Task-Parallel Dataflow Programming Framework for Modern FPGAs with Co-Optimization of HLS and Physical Design

In this paper, we propose TAPA, an end-to-end framework that compiles a ...

Please sign up or login with your details

Forgot password? Click here to reset