Solving Hard AI Planning Instances Using Curriculum-Driven Deep Reinforcement Learning

06/04/2020
by   Dieqiao Feng, et al.
5

Despite significant progress in general AI planning, certain domains remain out of reach of current AI planning systems. Sokoban is a PSPACE-complete planning task and represents one of the hardest domains for current AI planners. Even domain-specific specialized search methods fail quickly due to the exponential search complexity on hard instances. Our approach based on deep reinforcement learning augmented with a curriculum-driven method is the first one to solve hard instances within one day of training while other modern solvers cannot solve these instances within any reasonable time limit. In contrast to prior efforts, which use carefully handcrafted pruning techniques, our approach automatically uncovers domain structure. Our results reveal that deep RL provides a promising framework for solving previously unsolved AI planning problems, provided a proper training curriculum can be devised.

READ FULL TEXT

page 3

page 4

research
10/03/2021

A Novel Automated Curriculum Strategy to Solve Hard Sokoban Planning Instances

In recent years, we have witnessed tremendous progress in deep reinforce...
research
06/28/2022

Left Heavy Tails and the Effectiveness of the Policy and Value Networks in DNN-based best-first search for Sokoban Planning

Despite the success of practical solvers in various NP-complete domains ...
research
05/05/2020

Generalized Planning With Deep Reinforcement Learning

A hallmark of intelligence is the ability to deduce general principles f...
research
09/20/2022

Graph Value Iteration

In recent years, deep Reinforcement Learning (RL) has been successful in...
research
01/24/2023

NeSIG: A Neuro-Symbolic Method for Learning to Generate Planning Problems

In the field of Automated Planning there is often the need for a set of ...
research
09/09/2011

Macro-FF: Improving AI Planning with Automatically Learned Macro-Operators

Despite recent progress in AI planning, many benchmarks remain challengi...
research
08/04/2023

Solving Witness-type Triangle Puzzles Faster with an Automatically Learned Human-Explainable Predicate

Automatically solving puzzle instances in the game The Witness can guide...

Please sign up or login with your details

Forgot password? Click here to reset