Transfer Learning Across Patient Variations with Hidden Parameter Markov Decision Processes

12/01/2016
by   Taylor Killian, et al.
0

Due to physiological variation, patients diagnosed with the same condition may exhibit divergent, but related, responses to the same treatments. Hidden Parameter Markov Decision Processes (HiP-MDPs) tackle this transfer-learning problem by embedding these tasks into a low-dimensional space. However, the original formulation of HiP-MDP had a critical flaw: the embedding uncertainty was modeled independently of the agent's state uncertainty, requiring an unnatural training procedure in which all tasks visited every part of the state space---possible for robots that can be moved to a particular location, impossible for human patients. We update the HiP-MDP framework and extend it to more robustly develop personalized medicine strategies for HIV treatment.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/20/2017

Robust and Efficient Transfer Learning with Hidden-Parameter Markov Decision Processes

We introduce a new formulation of the Hidden Parameter Markov Decision P...
research
08/15/2013

Hidden Parameter Markov Decision Processes: A Semiparametric Regression Approach for Discovering Latent Task Parametrizations

Control applications often feature tasks with similar, but not identical...
research
10/21/2016

Learning Cost-Effective Treatment Regimes using Markov Decision Processes

Decision makers, such as doctors and judges, make crucial decisions such...
research
11/15/2017

Markov Decision Processes with Continuous Side Information

We consider a reinforcement learning (RL) setting in which the agent int...
research
03/12/2021

On Incorporating Forecasts into Linear State Space Model Markov Decision Processes

Weather forecast information will very likely find increasing applicatio...
research
03/27/2013

Problem Formulation as the Reduction of a Decision Model

In this paper, we extend the QMRDT probabilistic model for the domain of...
research
10/08/2020

Adaptive Shielding under Uncertainty

This paper targets control problems that exhibit specific safety and per...

Please sign up or login with your details

Forgot password? Click here to reset