Robust and Efficient Planning using Adaptive Entropy Tree Search

02/12/2021
by   Piotr Kozakowski, et al.
0

In this paper, we present the Adaptive EntropyTree Search (ANTS) algorithm. ANTS builds on recent successes of maximum entropy planning while mitigating its arguably major drawback - sensitivity to the temperature setting. We endow ANTS with a mechanism, which adapts the temperature to match a given range of action selection entropy in the nodes of the planning tree. With this mechanism, the ANTS planner enjoys remarkable hyper-parameter robustness, achieves high scores on the Atari benchmark, and is a capable component of a planning-learning loop akin to AlphaZero. We believe that all these features make ANTS a compelling choice for a general planner for complex tasks.

READ FULL TEXT
research
07/04/2013

Towards Combining HTN Planning and Geometric Task Planning

In this paper we present an interface between a symbolic planner and a g...
research
06/08/2023

Habits of Mind: Reusing Action Sequences for Efficient Planning

When we exercise sequences of actions, their execution becomes more flue...
research
11/27/2019

Learning Neural Search Policies for Classical Planning

Heuristic forward search is currently the dominant paradigm in classical...
research
10/16/1998

SYNERGY: A Linear Planner Based on Genetic Programming

In this paper we describe SYNERGY, which is a highly parallelizable, lin...
research
10/23/2018

Learning Classical Planning Strategies with Policy Gradient

A common paradigm in classical planning is heuristic forward search. For...
research
11/13/2021

Agile Satellite Planning for Multi-Payload Observations for Earth Science

We present planning challenges, methods and preliminary results for a ne...
research
06/01/2022

Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search

Complex reasoning problems contain states that vary in the computational...

Please sign up or login with your details

Forgot password? Click here to reset