Reinforcement Learning Under Algorithmic Triage

09/23/2021
by   Eleni Straitouri, et al.
0

Methods to learn under algorithmic triage have predominantly focused on supervised learning settings where each decision, or prediction, is independent of each other. Under algorithmic triage, a supervised learning model predicts a fraction of the instances and humans predict the remaining ones. In this work, we take a first step towards developing reinforcement learning models that are optimized to operate under algorithmic triage. To this end, we look at the problem through the framework of options and develop a two-stage actor-critic method to learn reinforcement learning models under triage. The first stage performs offline, off-policy training using human data gathered in an environment where the human has operated on their own. The second stage performs on-policy training to account for the impact that switching may have on the human policy, which may be difficult to anticipate from the above human data. Extensive simulation experiments in a synthetic car driving task show that the machine models and the triage policies trained using our two-stage method effectively complement human policies and outperform those provided by several competitive baselines.

READ FULL TEXT
research
01/30/2021

Stay Alive with Many Options: A Reinforcement Learning Approach for Autonomous Navigation

Hierarchical reinforcement learning approaches learn policies based on h...
research
05/17/2021

Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning

Offline Reinforcement Learning promises to learn effective policies from...
research
03/16/2021

Differentiable Learning Under Triage

Multiple lines of evidence suggest that predictive models may benefit fr...
research
07/01/2020

Developing cooperative policies for multi-stage tasks

This paper proposes the Cooperative Soft Actor Critic (CSAC) method of e...
research
02/03/2023

Two-Stage Constrained Actor-Critic for Short Video Recommendation

The wide popularity of short videos on social media poses new opportunit...
research
02/11/2020

Learning to Switch Between Machines and Humans

Reinforcement learning algorithms have been mostly developed and evaluat...
research
10/20/2021

Feedback Linearization of Car Dynamics for Racing via Reinforcement Learning

Through the method of Learning Feedback Linearization, we seek to learn ...

Please sign up or login with your details

Forgot password? Click here to reset