Interactive Learning from Activity Description

02/13/2021
by   Khanh Nguyen, et al.
6

We present a novel interactive learning protocol that enables training request-fulfilling agents by verbally describing their activities. Our protocol gives rise to a new family of interactive learning algorithms that offer complementary advantages against traditional algorithms like imitation learning (IL) and reinforcement learning (RL). We develop an algorithm that practically implements this protocol and employ it to train agents in two challenging request-fulfilling problems using purely language-description feedback. Empirical results demonstrate the strengths of our algorithm: compared to RL baselines, it is more sample-efficient; compared to IL baselines, it achieves competitive success rates while not requiring feedback providers to have agent-specific expertise. We also provide theoretical guarantees of the algorithm under certain assumptions on the teacher and the environment.

READ FULL TEXT

page 2

page 31

page 36

research
06/09/2021

PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training

Conveying complex objectives to reinforcement learning (RL) agents can o...
research
08/02/2020

Interactive Imitation Learning in State-Space

Imitation Learning techniques enable programming the behavior of agents ...
research
06/23/2014

Reinforcement and Imitation Learning via Interactive No-Regret Learning

Recent work has demonstrated that problems-- particularly imitation lear...
research
09/08/2022

Improved Robust Algorithms for Learning with Discriminative Feature Feedback

Discriminative Feature Feedback is a setting proposed by Dastupta et al....
research
12/02/2021

Towards Intrinsic Interactive Reinforcement Learning

Reinforcement learning (RL) and brain-computer interfaces (BCI) are two ...
research
05/30/2019

On Value Functions and the Agent-Environment Boundary

When function approximation is deployed in reinforcement learning (RL), ...
research
02/06/2023

A Strong Baseline for Batch Imitation Learning

Imitation of expert behaviour is a highly desirable and safe approach to...

Please sign up or login with your details

Forgot password? Click here to reset