Reinforcement Learning of Action and Query Policies with LTL Instructions under Uncertain Event Detector

09/06/2023
by   Wataru Hatanaka, et al.
0

Reinforcement learning (RL) with linear temporal logic (LTL) objectives can allow robots to carry out symbolic event plans in unknown environments. Most existing methods assume that the event detector can accurately map environmental states to symbolic events; however, uncertainty is inevitable for real-world event detectors. Such uncertainty in an event detector generates multiple branching possibilities on LTL instructions, confusing action decisions. Moreover, the queries to the uncertain event detector, necessary for the task's progress, may increase the uncertainty further. To cope with those issues, we propose an RL framework, Learning Action and Query over Belief LTL (LAQBL), to learn an agent that can consider the diversity of LTL instructions due to uncertain event detection while avoiding task failure due to the unnecessary event-detection query. Our framework simultaneously learns 1) an embedding of belief LTL, which is multiple branching possibilities on LTL instructions using a graph neural network, 2) an action policy, and 3) a query policy which decides whether or not to query for the event detector. Simulations in a 2D grid world and image-input robotic inspection environments show that our method successfully learns actions to follow LTL instructions even with uncertain event detectors.

READ FULL TEXT

page 1

page 7

research
04/07/2022

A Framework for Following Temporal Logic Instructions with Unknown Causal Dependencies

Teaching a deep reinforcement learning (RL) agent to follow instructions...
research
02/13/2021

LTL2Action: Generalizing LTL Instructions for Multi-Task RL

We address the problem of teaching a deep reinforcement learning (RL) ag...
research
11/01/2022

Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language Instructions

The adoption of pre-trained language models to generate action plans for...
research
12/21/2018

Learning to Navigate the Web

Learning in environments with large state and action spaces, and sparse ...
research
10/16/2018

Composable Action-Conditioned Predictors: Flexible Off-Policy Learning for Robot Navigation

A general-purpose intelligent robot must be able to learn autonomously a...
research
12/14/2021

Quantifying Multimodality in World Models

Model-based Deep Reinforcement Learning (RL) assumes the availability of...
research
06/09/2021

TempoRL: Learning When to Act

Reinforcement learning is a powerful approach to learn behaviour through...

Please sign up or login with your details

Forgot password? Click here to reset