DeepAI AI Chat
Log In Sign Up

Learning to Ask Medical Questions using Reinforcement Learning

03/31/2020
by   Uri Shaham, et al.
2

We propose a novel reinforcement learning-based approach for adaptive and iterative feature selection. Given a masked vector of input features, a reinforcement learning agent iteratively selects certain features to be unmasked, and uses them to predict an outcome when it is sufficiently confident. The algorithm makes use of a novel environment setting, corresponding to a non-stationary Markov Decision Process. A key component of our approach is a guesser network, trained to predict the outcome from the selected features and parametrizing the reward function. Applying our method to a national survey dataset, we show that it not only outperforms strong baselines when requiring the prediction to be made based on a small number of input features, but is also highly more interpretable. Our code is publicly available at <https://github.com/ushaham/adaptiveFS>.

READ FULL TEXT
05/11/2020

Delay-Aware Model-Based Reinforcement Learning for Continuous Control

Action delays degrade the performance of reinforcement learning in many ...
01/24/2022

Pearl: Parallel Evolutionary and Reinforcement Learning Library

Reinforcement learning is increasingly finding success across domains wh...
03/09/2017

Sample Efficient Feature Selection for Factored MDPs

In reinforcement learning, the state of the real world is often represen...
07/07/2022

Energy-based Legged Robots Terrain Traversability Modeling via Deep Inverse Reinforcement Learning

This work reports on developing a deep inverse reinforcement learning me...
09/29/2018

Reinforcement Learning in R

Reinforcement learning refers to a group of methods from artificial inte...
05/19/2021

Online Selection of Diverse Committees

Citizens' assemblies need to represent subpopulations according to their...
09/25/2019

"Good Robot!": Efficient Reinforcement Learning for Multi-Step Visual Tasks via Reward Shaping

In order to learn effectively, robots must be able to extract the intang...