Inverse Constrained Reinforcement Learning

11/19/2020
by   Usman Anwar, et al.
10

Standard reinforcement learning (RL) algorithms train agents to maximize given reward functions. However, many real-world applications of RL require agents to also satisfy certain constraints which may, for example, be motivated by safety concerns. Constrained RL algorithms approach this problem by training agents to maximize given reward functions while respecting explicitly defined constraints. However, in many cases, manually designing accurate constraints is a challenging task. In this work, given a reward function and a set of demonstrations from an expert that maximizes this reward function while respecting unknown constraints, we propose a framework to learn the most likely constraints that the expert respects. We then train agents to maximize the given reward function subject to the learned constraints. Previous works in this regard have either mainly been restricted to tabular settings or specific types of constraints or assume knowledge of transition dynamics of the environment. In contrast, we empirically show that our framework is able to learn arbitrary Markovian constraints in high-dimensions in a model-free setting.

READ FULL TEXT

page 2

page 6

page 8

research
06/02/2022

Learning Soft Constraints From Constrained Expert Demonstrations

Inverse reinforcement learning (IRL) methods assume that the expert data...
research
02/22/2020

Guided Constrained Policy Optimization for Dynamic Quadrupedal Robot Locomotion

Deep reinforcement learning (RL) uses model-free techniques to optimize ...
research
06/20/2022

Benchmarking Constraint Inference in Inverse Reinforcement Learning

When deploying Reinforcement Learning (RL) agents into a physical system...
research
05/15/2023

What Matters in Reinforcement Learning for Tractography

Recently, deep reinforcement learning (RL) has been proposed to learn th...
research
03/23/2021

Assured Learning-enabled Autonomy: A Metacognitive Reinforcement Learning Framework

Reinforcement learning (RL) agents with pre-specified reward functions c...
research
06/01/2023

Identifiability and Generalizability in Constrained Inverse Reinforcement Learning

Two main challenges in Reinforcement Learning (RL) are designing appropr...
research
04/14/2023

Learning to Learn Group Alignment: A Self-Tuning Credo Framework with Multiagent Teams

Mixed incentives among a population with multiagent teams has been shown...

Please sign up or login with your details

Forgot password? Click here to reset