Neural Logic Reinforcement Learning

04/24/2019
by   Zhengyao Jiang, et al.
0

Deep reinforcement learning (DRL) has achieved significant breakthroughs in various tasks. However, most DRL algorithms suffer a problem of generalizing the learned policy which makes the learning performance largely affected even by minor modifications of the training environment. Except that, the use of deep neural networks makes the learned policies hard to be interpretable. To address these two challenges, we propose a novel algorithm named Neural Logic Reinforcement Learning (NLRL) to represent the policies in reinforcement learning by first-order logic. NLRL is based on policy gradient methods and differentiable inductive logic programming that have demonstrated significant advantages in terms of interpretability and generalisability in supervised tasks. Extensive experiments conducted on cliff-walking and blocks manipulation tasks demonstrate that NLRL can induce interpretable policies achieving near-optimal performance while demonstrating good generalisability to environments of different initial states and problem sizes.

READ FULL TEXT
research
04/17/2023

Deep Explainable Relational Reinforcement Learning: A Neuro-Symbolic Approach

Despite numerous successes in Deep Reinforcement Learning (DRL), the lea...
research
04/06/2018

Programmatically Interpretable Reinforcement Learning

We study the problem of generating interpretable and verifiable policies...
research
09/12/2021

Direct Random Search for Fine Tuning of Deep Reinforcement Learning Policies

Researchers have demonstrated that Deep Reinforcement Learning (DRL) is ...
research
03/16/2023

Psychotherapy AI Companion with Reinforcement Learning Recommendations and Interpretable Policy Dynamics

We introduce a Reinforcement Learning Psychotherapy AI Companion that ge...
research
06/20/2023

Neural Inventory Control in Networks via Hindsight Differentiable Policy Optimization

Inventory management offers unique opportunities for reliably evaluating...
research
05/26/2022

Verifying Learning-Based Robotic Navigation Systems

Deep reinforcement learning (DRL) has become a dominant deep-learning pa...
research
08/31/2021

Learning to Synthesize Programs as Interpretable and Generalizable Policies

Recently, deep reinforcement learning (DRL) methods have achieved impres...

Please sign up or login with your details

Forgot password? Click here to reset