GLIB: Exploration via Goal-Literal Babbling for Lifted Operator Learning

01/22/2020
by   Rohan Chitnis, et al.
7

We address the problem of efficient exploration for learning lifted operators in sequential decision-making problems without extrinsic goals or rewards. Inspired by human curiosity, we propose goal-literal babbling (GLIB), a simple and general method for exploration in such problems. GLIB samples goals that are conjunctions of literals, which can be understood as specific, targeted effects that the agent would like to achieve in the world, and plans to achieve these goals using the operators being learned. We conduct a case study to elucidate two key benefits of GLIB: robustness to overly general preconditions and efficient exploration in domains with effects at long horizons. We also provide theoretical guarantees and further empirical results, finding GLIB to be effective on a range of benchmark planning tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/16/2022

Learning Operators with Ignore Effects for Bilevel Planning in Continuous Domains

Bilevel planning, in which a high-level search over an abstraction of an...
research
03/23/2023

Planning Goals for Exploration

Dropped into an unknown environment, what should an agent do to quickly ...
research
05/30/2022

Adaptive Learning for Discovery

In this paper, we study a sequential decision-making problem, called Ada...
research
12/24/2020

SPOTTER: Extending Symbolic Planning Operators through Targeted Reinforcement Learning

Symbolic planning models allow decision-making agents to sequence action...
research
11/09/2021

Optimizing robot planning domains to reduce search time for long-horizon planning

We have recently introduced a system that automatically generates roboti...
research
06/10/2019

Autonomous Goal Exploration using Learned Goal Spaces for Visuomotor Skill Acquisition in Robots

The automatic and efficient discovery of skills, without supervision, fo...
research
05/16/2023

Sasha: creative goal-oriented reasoning in smart homes with large language models

Every smart home user interaction has an explicit or implicit goal. Exis...

Please sign up or login with your details

Forgot password? Click here to reset