Meta-Adversarial Inverse Reinforcement Learning for Decision-making Tasks

03/23/2021
by   Pin Wang, et al.
0

Learning from demonstrations has made great progress over the past few years. However, it is generally data hungry and task specific. In other words, it requires a large amount of data to train a decent model on a particular task, and the model often fails to generalize to new tasks that have a different distribution. In practice, demonstrations from new tasks will be continuously observed and the data might be unlabeled or only partially labeled. Therefore, it is desirable for the trained model to adapt to new tasks that have limited data samples available. In this work, we build an adaptable imitation learning model based on the integration of Meta-learning and Adversarial Inverse Reinforcement Learning (Meta-AIRL). We exploit the adversarial learning and inverse reinforcement learning mechanisms to learn policies and reward functions simultaneously from available training tasks and then adapt them to new tasks with the meta-learning framework. Simulation results show that the adapted policy trained with Meta-AIRL can effectively learn from limited number of demonstrations, and quickly reach the performance comparable to that of the experts on unseen tasks.

READ FULL TEXT
research
06/07/2019

Watch, Try, Learn: Meta-Learning from Demonstrations and Reward

Imitation learning allows agents to learn complex behaviors from demonst...
research
03/29/2020

When Autonomous Systems Meet Accuracy and Transferability through AI: A Survey

With widespread applications of artificial intelligence (AI), the capabi...
research
11/02/2020

NEARL: Non-Explicit Action Reinforcement Learning for Robotic Control

Traditionally, reinforcement learning methods predict the next action ba...
research
10/18/2020

Model-Based Inverse Reinforcement Learning from Visual Demonstrations

Scaling model-based inverse reinforcement learning (IRL) to real robotic...
research
06/18/2011

Bayesian multitask inverse reinforcement learning

We generalise the problem of inverse reinforcement learning to multiple ...
research
11/23/2019

Meta Adaptation using Importance Weighted Demonstrations

Imitation learning has gained immense popularity because of its high sam...
research
07/14/2021

Deep Adaptive Multi-Intention Inverse Reinforcement Learning

This paper presents a deep Inverse Reinforcement Learning (IRL) framewor...

Please sign up or login with your details

Forgot password? Click here to reset