We study the action generalization ability of deep Q-learning in discret...
Principled decision-making in continuous state–action spaces is impossib...
Reinforcement learning is hard in general. Yet, in many specific
environ...
The fundamental assumption of reinforcement learning in Markov decision
...
The difficulty of classical planning increases exponentially with search...
We propose a new algorithm, Mean Actor-Critic (MAC), for discrete-action...