Robust Imitation via Mirror Descent Inverse Reinforcement Learning

10/20/2022
by   Dong-Sig Han, et al.
0

Recently, adversarial imitation learning has shown a scalable reward acquisition method for inverse reinforcement learning (IRL) problems. However, estimated reward signals often become uncertain and fail to train a reliable statistical model since the existing methods tend to solve hard optimization problems directly. Inspired by a first-order optimization method called mirror descent, this paper proposes to predict a sequence of reward functions, which are iterative solutions for a constrained convex problem. IRL solutions derived by mirror descent are tolerant to the uncertainty incurred by target density estimation since the amount of reward learning is regulated with respect to local geometric constraints. We prove that the proposed mirror descent update rule ensures robust minimization of a Bregman divergence in terms of a rigorous regret bound of 𝒪(1/T) for step sizes {η_t}_t=1^T. Our IRL method was applied on top of an adversarial framework, and it outperformed existing adversarial methods in an extensive suite of benchmarks.

READ FULL TEXT
research
11/09/2020

f-IRL: Inverse Reinforcement Learning via State Marginal Matching

Imitation learning is well-suited for robotic tasks where it is difficul...
research
12/09/2018

Dialogue Generation: From Imitation Learning to Inverse Reinforcement Learning

The performance of adversarial dialogue generation models relies on the ...
research
06/14/2023

Curricular Subgoals for Inverse Reinforcement Learning

Inverse Reinforcement Learning (IRL) aims to reconstruct the reward func...
research
06/23/2021

IQ-Learn: Inverse soft-Q Learning for Imitation

In many sequential decision-making problems (e.g., robotics control, gam...
research
05/17/2023

A proof of imitation of Wasserstein inverse reinforcement learning for multi-objective optimization

We prove Wasserstein inverse reinforcement learning enables the learner'...
research
02/12/2021

Scalable Bayesian Inverse Reinforcement Learning

Bayesian inference over the reward presents an ideal solution to the ill...
research
01/19/2021

Mirror-Descent Inverse Kinematics for Box-constrained Joint Space

This paper proposes a new Jacobian-based inverse kinematics (IK) explici...

Please sign up or login with your details

Forgot password? Click here to reset