Teaching an Active Learner with Contrastive Examples

10/28/2021
by   Chaoqi Wang, et al.
9

We study the problem of active learning with the added twist that the learner is assisted by a helpful teacher. We consider the following natural interaction protocol: At each round, the learner proposes a query asking for the label of an instance x^q, the teacher provides the requested label {x^q, y^q} along with explanatory information to guide the learning process. In this paper, we view this information in the form of an additional contrastive example ({x^c, y^c}) where x^c is picked from a set constrained by x^q (e.g., dissimilar instances with the same label). Our focus is to design a teaching algorithm that can provide an informative sequence of contrastive examples to the learner to speed up the learning process. We show that this leads to a challenging sequence optimization problem where the algorithm's choices at a given round depend on the history of interactions. We investigate an efficient teaching algorithm that adaptively picks these contrastive examples. We derive strong performance guarantees for our algorithm based on two problem-dependent parameters and further show that for specific types of active learners (e.g., a generalized binary search learner), the proposed teaching algorithm exhibits strong approximation guarantees. Finally, we illustrate our bounds and demonstrate the effectiveness of our teaching framework via two numerical case studies.

READ FULL TEXT
research
05/28/2019

Interactive Teaching Algorithms for Inverse Reinforcement Learning

We study the problem of inverse reinforcement learning (IRL) with the ad...
research
05/21/2018

Teaching Multiple Concepts to Forgetful Learners

How can we help a forgetful learner learn multiple concepts within a lim...
research
02/20/2018

Teaching Categories to Human Learners with Visual Explanations

We study the problem of computer-assisted teaching with explanations. Co...
research
04/08/2019

Ensemble Teaching for Hybrid Label Propagation

Label propagation aims to iteratively diffuse the label information from...
research
06/21/2021

Optimizing piano practice with a utility-based scaffold

A typical part of learning to play the piano is the progression through ...
research
06/30/2022

When an Active Learner Meets a Black-box Teacher

Active learning maximizes the hypothesis updates to find those desired u...
research
10/01/2018

Simple Algorithms for Learning from Random Counterexamples

This work describes two simple and efficient algorithms for exactly lear...

Please sign up or login with your details

Forgot password? Click here to reset