Off-policy evaluation (OPE) aims to estimate the benefit of following a
...
Many reinforcement learning (RL) applications have combinatorial action
...
A collection of the extended abstracts that were presented at the 2nd Ma...
Modern decision-making systems, from robots to web recommendation engine...
Reinforcement learning (RL) can be used to learn treatment policies and ...
Standard reinforcement learning (RL) aims to find an optimal policy that...
Recurrent neural networks (RNNs) are commonly applied to clinical time-s...