Data-Driven Inverse Reinforcement Learning for Expert-Learner Zero-Sum Games

01/05/2023
by   Wenqian Xue, et al.
0

In this paper, we formulate inverse reinforcement learning (IRL) as an expert-learner interaction whereby the optimal performance intent of an expert or target agent is unknown to a learner agent. The learner observes the states and controls of the expert and hence seeks to reconstruct the expert's cost function intent and thus mimics the expert's optimal response. Next, we add non-cooperative disturbances that seek to disrupt the learning and stability of the learner agent. This leads to the formulation of a new interaction we call zero-sum game IRL. We develop a framework to solve the zero-sum game IRL problem that is a modified extension of RL policy iteration (PI) to allow unknown expert performance intentions to be computed and non-cooperative disturbances to be rejected. The framework has two parts: a value function and control action update based on an extension of PI, and a cost function update based on standard inverse optimal control. Then, we eventually develop an off-policy IRL algorithm that does not require knowledge of the expert and learner agent dynamics and performs single-loop learning. Rigorous proofs and analyses are given. Finally, simulation experiments are presented to show the effectiveness of the new approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/18/2019

Inverse Cooperative and Non-Cooperative Dynamic Games Based on Maximum Entropy Inverse Reinforcement Learning

Dynamic game theory provides mathematical means for modeling the interac...
research
04/09/2021

Inverse Reinforcement Learning a Control Lyapunov Approach

Inferring the intent of an intelligent agent from demonstrations and sub...
research
07/02/2020

Robust Inverse Reinforcement Learning under Transition Dynamics Mismatch

We study the inverse reinforcement learning (IRL) problem under the tran...
research
10/27/2017

Inverse Reinforcement Learning Under Noisy Observations

We consider the problem of performing inverse reinforcement learning whe...
research
01/07/2018

Competitive Multi-agent Inverse Reinforcement Learning with Sub-optimal Demonstrations

This paper considers the problem of inverse reinforcement learning in ze...
research
05/17/2023

A proof of imitation of Wasserstein inverse reinforcement learning for multi-objective optimization

We prove Wasserstein inverse reinforcement learning enables the learner'...
research
06/07/2021

Learning without Knowing: Unobserved Context in Continuous Transfer Reinforcement Learning

In this paper, we consider a transfer Reinforcement Learning (RL) proble...

Please sign up or login with your details

Forgot password? Click here to reset