Off-Dynamics Inverse Reinforcement Learning from Hetero-Domain

10/21/2021
by   Yachen Kang, et al.
0

We propose an approach for inverse reinforcement learning from hetero-domain which learns a reward function in the simulator, drawing on the demonstrations from the real world. The intuition behind the method is that the reward function should not only be oriented to imitate the experts, but should encourage actions adjusted for the dynamics difference between the simulator and the real world. To achieve this, the widely used GAN-inspired IRL method is adopted, and its discriminator, recognizing policy-generating trajectories, is modified with the quantification of dynamics difference. The training process of the discriminator can yield the transferable reward function suitable for simulator dynamics, which can be guaranteed by derivation. Effectively, our method assigns higher rewards for demonstration trajectories which do not exploit discrepancies between the two domains. With extensive experiments on continuous control tasks, our method shows its effectiveness and demonstrates its scalability to high-dimensional tasks.

READ FULL TEXT

page 4

page 5

page 6

research
05/31/2018

Learning a Prior over Intent via Meta-Inverse Reinforcement Learning

A significant challenge for the practical application of reinforcement l...
research
02/01/2023

Internally Rewarded Reinforcement Learning

We study a class of reinforcement learning problems where the reward sig...
research
09/25/2022

Reward Learning using Structural Motifs in Inverse Reinforcement Learning

The Inverse Reinforcement Learning (IRL) problem has seen rapid evolutio...
research
06/24/2020

Off-Dynamics Reinforcement Learning: Training for Transfer with Domain Classifiers

We propose a simple, practical, and intuitive approach for domain adapta...
research
12/12/2019

Improved Activity Forecasting for Generating Trajectories

An efficient inverse reinforcement learning for generating trajectories ...
research
10/20/2022

The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning

Deep Reinforcement Learning (DRL) has achieved remarkable success in sce...
research
05/20/2021

Objective-aware Traffic Simulation via Inverse Reinforcement Learning

Traffic simulators act as an essential component in the operating and pl...

Please sign up or login with your details

Forgot password? Click here to reset