Active Imitation Learning via Reduction to I.I.D. Active Learning

10/16/2012
by   Kshitij Judah, et al.
0

In standard passive imitation learning, the goal is to learn a target policy by passively observing full execution trajectories of it. Unfortunately, generating such trajectories can require substantial expert effort and be impractical in some cases. In this paper, we consider active imitation learning with the goal of reducing this effort by querying the expert about the desired action at individual states, which are selected based on answers to past queries and the learner's interactions with an environment simulator. We introduce a new approach based on reducing active imitation learning to i.i.d. active learning, which can leverage progress in the i.i.d. setting. Our first contribution, is to analyze reductions for both non-stationary and stationary policies, showing that the label complexity (number of queries) of active imitation learning can be substantially less than passive learning. Our second contribution, is to introduce a practical algorithm inspired by the reductions, which is shown to be highly effective in four test domains compared to a number of alternatives.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/26/2020

Active Imitation Learning with Noisy Guidance

Imitation learning algorithms provide state-of-the-art results on many s...
research
06/14/2020

Active Imitation Learning from Multiple Non-Deterministic Teachers: Formulation, Challenges, and Algorithms

We formulate the problem of learning to imitate multiple, non-determinis...
research
06/18/2019

RadGrad: Active learning with loss gradients

Solving sequential decision prediction problems, including those in imit...
research
02/17/2021

Fully General Online Imitation Learning

In imitation learning, imitators and demonstrators are policies for pick...
research
07/01/2019

Active Learning within Constrained Environments through Imitation of an Expert Questioner

Active learning agents typically employ a query selection algorithm whic...
research
07/20/2020

Modelling, Simulation, and Planning for the MoleMOD System

MoleMOD is a heterogeneous self-reconfigurable modular robotic system to...
research
07/09/2020

IALE: Imitating Active Learner Ensembles

Active learning (AL) prioritizes the labeling of the most informative da...

Please sign up or login with your details

Forgot password? Click here to reset