Interaction-limited Inverse Reinforcement Learning

07/01/2020
by   martin-troussard, et al.
0

This paper proposes an inverse reinforcement learning (IRL) framework to accelerate learning when the learner-teacher interaction is limited during training. Our setting is motivated by the realistic scenarios where a helpful teacher is not available or when the teacher cannot access the learning dynamics of the student. We present two different training strategies: Curriculum Inverse Reinforcement Learning (CIRL) covering the teacher's perspective, and Self-Paced Inverse Reinforcement Learning (SPIRL) focusing on the learner's perspective. Using experiments in simulations and experiments with a real robot learning a task from a human demonstrator, we show that our training strategies can allow a faster training than a random teacher for CIRL and than a batch learner for SPIRL.

READ FULL TEXT

page 12

page 13

research
05/28/2019

Interactive Teaching Algorithms for Inverse Reinforcement Learning

We study the problem of inverse reinforcement learning (IRL) with the ad...
research
09/19/2021

A Conformal Mapping-based Framework for Robot-to-Robot and Sim-to-Real Transfer Learning

This paper presents a novel method for transferring motion planning and ...
research
09/18/2019

On the Right Path: A Modal Logic for Supervised Learning

Formal learning theory formalizes the process of inferring a general res...
research
05/27/2022

Learning to Find Proofs and Theorems by Learning to Refine Search Strategies

We propose a new approach to automated theorem proving and deductive pro...
research
07/31/2021

Inverse Reinforcement Learning for Strategy Identification

In adversarial environments, one side could gain an advantage by identif...
research
11/29/2020

A Q-values Sharing Framework for Multiagent Reinforcement Learning under Budget Constraint

In teacher-student framework, a more experienced agent (teacher) helps a...
research
04/04/2023

Optimal Transport for Correctional Learning

The contribution of this paper is a generalized formulation of correctio...

Please sign up or login with your details

Forgot password? Click here to reset