Active Imitation Learning with Noisy Guidance

05/26/2020
by   Kianté Brantley, et al.
9

Imitation learning algorithms provide state-of-the-art results on many structured prediction tasks by learning near-optimal search policies. Such algorithms assume training-time access to an expert that can provide the optimal action at any queried state; unfortunately, the number of such queries is often prohibitive, frequently rendering these approaches impractical. To combat this query complexity, we consider an active learning setting in which the learning algorithm has additional access to a much cheaper noisy heuristic that provides noisy guidance. Our algorithm, LEAQI, learns a difference classifier that predicts when the expert is likely to disagree with the heuristic, and queries the expert only when necessary. We apply LEAQI to three sequence labeling tasks, demonstrating significantly fewer queries to the expert and comparable (or better) accuracies over a passive approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/16/2012

Active Imitation Learning via Reduction to I.I.D. Active Learning

In standard passive imitation learning, the goal is to learn a target po...
research
01/03/2023

DADAgger: Disagreement-Augmented Dataset Aggregation

DAgger is an imitation algorithm that aggregates its original datasets b...
research
03/11/2016

Near-Optimal Active Learning of Halfspaces via Query Synthesis in the Noisy Setting

In this paper, we consider the problem of actively learning a linear cla...
research
06/18/2019

RadGrad: Active learning with loss gradients

Solving sequential decision prediction problems, including those in imit...
research
07/09/2020

IALE: Imitating Active Learner Ensembles

Active learning (AL) prioritizes the labeling of the most informative da...
research
02/17/2021

Fully General Online Imitation Learning

In imitation learning, imitators and demonstrators are policies for pick...
research
07/11/2023

Selective Sampling and Imitation Learning via Online Regression

We consider the problem of Imitation Learning (IL) by actively querying ...

Please sign up or login with your details

Forgot password? Click here to reset