-
Optimistic Policy Iteration for MDPs with Acyclic Transient State Structure
We consider Markov Decision Processes (MDPs) in which every stationary p...
read it
-
Combining Reinforcement Learning with Model Predictive Control for On-Ramp Merging
We consider the problem of designing an algorithm to allow a car to auto...
read it

Joseph Lubars
is this you? claim profile