Accelerating Monte-Carlo Tree Search on CPU-FPGA Heterogeneous Platform

08/23/2022
by   Yuan Meng, et al.
0

Monte Carlo Tree Search (MCTS) methods have achieved great success in many Artificial Intelligence (AI) benchmarks. The in-tree operations become a critical performance bottleneck in realizing parallel MCTS on CPUs. In this work, we develop a scalable CPU-FPGA system for Tree-Parallel MCTS. We propose a novel decomposition and mapping of MCTS data structure and computation onto CPU and FPGA to reduce communication and coordination. High scalability of our system is achieved by encapsulating in-tree operations in an SRAM-based FPGA accelerator. To lower the high data access latency and inter-worker synchronization overheads, we develop several hardware optimizations. We show that by using our accelerator, we obtain up to 35× speedup for in-tree operations, and 3× higher overall system throughput. Our CPU-FPGA system also achieves superior scalability wrt number of parallel workers than state-of-the-art parallel MCTS implementations on CPU.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/01/2019

A Novel FPGA-Based High Throughput Accelerator For Binary Search Trees

This paper presents a deeply pipelined and massively parallel Binary Sea...
research
04/02/2017

Structured Parallel Programming for Monte Carlo Tree Search

In this paper, we present a new algorithm for parallel Monte Carlo tree ...
research
08/29/2019

High Performance Scalable FPGA Accelerator for Deep Neural Networks

Low-precision is the first order knob for achieving higher Artificial In...
research
06/19/2023

Co-design Hardware and Algorithm for Vector Search

Vector search has emerged as the foundation for large-scale information ...
research
08/13/2023

When Monte-Carlo Dropout Meets Multi-Exit: Optimizing Bayesian Neural Networks on FPGA

Bayesian Neural Networks (BayesNNs) have demonstrated their capability o...
research
04/06/2023

A computation of D(9) using FPGA Supercomputing

This preprint makes the claim of having computed the 9^th Dedekind Numbe...
research
06/23/2021

Weighted Random Sampling on GPUs

An alias table is a data structure that allows for efficiently drawing w...

Please sign up or login with your details

Forgot password? Click here to reset