An AlphaZero-Inspired Approach to Solving Search Problems

07/02/2022
by   Evgeny Dantsin, et al.
0

AlphaZero and its extension MuZero are computer programs that use machine-learning techniques to play at a superhuman level in chess, go, and a few other games. They achieved this level of play solely with reinforcement learning from self-play, without any domain knowledge except the game rules. It is a natural idea to adapt the methods and techniques used in AlphaZero for solving search problems such as the Boolean satisfiability problem (in its search version). Given a search problem, how to represent it for an AlphaZero-inspired solver? What are the "rules of solving" for this search problem? We describe possible representations in terms of easy-instance solvers and self-reductions, and we give examples of such representations for the satisfiability problem. We also describe a version of Monte Carlo tree search adapted for search problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/05/2017

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

The game of chess is the most widely-studied domain in the history of ar...
research
01/17/2021

Solving QSAT problems with neural MCTS

Recent achievements from AlphaZero using self-play has shown remarkable ...
research
02/27/2019

Accelerating Self-Play Learning in Go

By introducing several new Go-specific and non-Go-specific techniques al...
research
05/22/2023

Know your Enemy: Investigating Monte-Carlo Tree Search with Opponent Models in Pommerman

In combination with Reinforcement Learning, Monte-Carlo Tree Search has ...
research
11/22/2022

Monte Carlo Forest Search: UNSAT Solver Synthesis via Reinforcement learning

We introduce Monte Carlo Forest Search (MCFS), an offline algorithm for ...
research
06/03/2011

GIB: Imperfect Information in a Computationally Challenging Game

This paper investigates the problems arising in the construction of a pr...
research
04/17/2023

Search-Space Pruning with Int-Splits for Faster QBF Solving

In many QBF encodings, sequences of Boolean variables stand for binary r...

Please sign up or login with your details

Forgot password? Click here to reset