A Policy Search Method For Temporal Logic Specified Reinforcement Learning Tasks

09/27/2017
by   Xiao Li, et al.
0

Reward engineering is an important aspect of reinforcement learning. Whether or not the user's intentions can be correctly encapsulated in the reward function can significantly impact the learning outcome. Current methods rely on manually crafted reward functions that often require parameter tuning to obtain the desired behavior. This operation can be expensive when exploration requires systems to interact with the physical world. In this paper, we explore the use of temporal logic (TL) to specify tasks in reinforcement learning. TL formula can be translated to a real-valued function that measures its level of satisfaction against a trajectory. We take advantage of this function and propose temporal logic policy search (TLPS), a model-free learning technique that finds a policy that satisfies the TL specification. A set of simulated experiments are conducted to evaluate the proposed approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/21/2021

Reinforcement Learning Agent Training with Goals for Real World Tasks

Reinforcement Learning (RL) is a promising approach for solving various ...
research
12/22/2021

Direct Behavior Specification via Constrained Reinforcement Learning

The standard formulation of Reinforcement Learning lacks a practical way...
research
05/04/2020

Formal Policy Synthesis for Continuous-Space Systems via Reinforcement Learning

This paper studies data-driven techniques for satisfying temporal proper...
research
11/30/2022

Reinforcement Learning for Signal Temporal Logic using Funnel-Based Approach

Signal Temporal Logic (STL) is a powerful framework for describing the c...
research
09/09/2021

OPIRL: Sample Efficient Off-Policy Inverse Reinforcement Learning via Distribution Matching

Inverse Reinforcement Learning (IRL) is attractive in scenarios where re...
research
08/11/2023

Reinforcement Logic Rule Learning for Temporal Point Processes

We propose a framework that can incrementally expand the explanatory tem...
research
05/17/2022

Moral reinforcement learning using actual causation

Reinforcement learning systems will to a greater and greater extent make...

Please sign up or login with your details

Forgot password? Click here to reset