Coagent networks for reinforcement learning (RL) [Thomas and Barto, 2011...
Robust Markov Decision Processes (MDPs) are getting more attention for
l...
Many real-world sequential decision-making problems involve critical sys...
Performance evaluations are critical for quantifying algorithmic advance...
We propose a new objective function for finite-horizon episodic Markov
d...
With the rise of neural models across the field of information retrieval...