Using Machine Teaching to Investigate Human Assumptions when Teaching Reinforcement Learners

09/05/2020
by   Yun-Shiuan Chuang, et al.
9

Successful teaching requires an assumption of how the learner learns - how the learner uses experiences from the world to update their internal states. We investigate what expectations people have about a learner when they teach them in an online manner using rewards and punishment. We focus on a common reinforcement learning method, Q-learning, and examine what assumptions people have using a behavioral experiment. To do so, we first establish a normative standard, by formulating the problem as a machine teaching optimization problem. To solve the machine teaching optimization problem, we use a deep learning approximation method which simulates learners in the environment and learns to predict how feedback affects the learner's internal states. What do people assume about a learner's learning and discount rates when they teach them an idealized exploration-exploitation task? In a behavioral experiment, we find that people can teach the task to Q-learners in a relatively efficient and effective manner when the learner uses a small value for its discounting rate and a large value for its learning rate. However, they still are suboptimal. We also find that providing people with real-time updates of how possible feedback would affect the Q-learner's internal states weakly helps them teach. Our results reveal how people teach using evaluative feedback and provide guidance for how engineers should design machine agents in a manner that is intuitive for people.

READ FULL TEXT

page 16

page 17

page 20

research
09/16/2023

Interactively Teaching an Inverse Reinforcement Learner with Limited Feedback

We study the problem of teaching via demonstrations in sequential decisi...
research
09/14/2020

Teaching to Learn: Sequential Teaching of Agents with Inner States

In sequential machine teaching, a teacher's objective is to provide the ...
research
11/26/2017

Pedagogical learning

A common assumption in machine learning is that training data are i.i.d....
research
05/21/2018

Teaching Multiple Concepts to Forgetful Learners

How can we help a forgetful learner learn multiple concepts within a lim...
research
10/09/2020

Large-scale randomized experiment reveals machine learning helps people learn and remember more effectively

Machine learning has typically focused on developing models and algorith...
research
11/29/2019

Class Teaching for Inverse Reinforcement Learners

In this paper we propose the first machine teaching algorithm for multip...
research
10/01/2021

Learner to learner fuzzy profiles similarity using a hybrid interaction analysis grid

The analysis of remote discussions is not yet at the same level as the f...

Please sign up or login with your details

Forgot password? Click here to reset