Systematic Generalisation through Task Temporal Logic and Deep Reinforcement Learning

06/12/2020
by   Borja G. León, et al.
31

This paper presents a neuro-symbolic agent that combines deep reinforcement learning (DRL) with temporal logic (TL), and achieves systematic out-of-distribution generalisation in tasks that involve following a formally specified instruction. Specifically, the agent learns general notions of negation and disjunction, and successfully applies them to previously unseen objects without further training. To this end, we also introduce Task Temporal Logic (TTL), a learning-oriented formal language, whose atoms are designed to help the training of a DRL agent targeting systematic generalisation. To validate this combination of logic-based and neural-network techniques, we provide experimental evidence for the kind of neural-network architecture that most enhances the generalisation performance of the agent. Our findings suggest that the right architecture can significatively improve the ability of the agent to generalise in systematic ways, even with abstract operators, such as negation, which previous research have struggled with.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/11/2023

Multi-Agent Reinforcement Learning Guided by Signal Temporal Logic Specifications

There has been growing interest in deep reinforcement learning (DRL) alg...
research
10/18/2021

In a Nutshell, the Human Asked for This: Latent Goals for Following Temporal Specifications

We address the problem of building agents whose goal is to satisfy out-o...
research
03/11/2019

Stroke-based Artistic Rendering Agent with Deep Reinforcement Learning

Excellent painters can use only a few strokes to create a fantastic pain...
research
05/30/2022

RLx2: Training a Sparse Deep Reinforcement Learning Model from Scratch

Training deep reinforcement learning (DRL) models usually requires high ...
research
12/24/2020

Auto-Agent-Distiller: Towards Efficient Deep Reinforcement Learning Agents via Neural Architecture Search

AlphaGo's astonishing performance has ignited an explosive interest in d...
research
03/24/2023

Robust Path Following on Rivers Using Bootstrapped Reinforcement Learning

This paper develops a Deep Reinforcement Learning (DRL)-agent for naviga...
research
10/03/2022

Learning Minimally-Violating Continuous Control for Infeasible Linear Temporal Logic Specifications

This paper explores continuous-time control synthesis for target-driven ...

Please sign up or login with your details

Forgot password? Click here to reset