-
Randomized Ensembled Double Q-Learning: Learning Fast Without a Model
Using a high Update-To-Data (UTD) ratio, model-based methods have recent...
read it
-
On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning
A simple and natural algorithm for reinforcement learning is Monte Carlo...
read it
-
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
The field of Deep Reinforcement Learning (DRL) has recently seen a surge...
read it
-
Towards Simplicity in Deep Reinforcement Learning: Streamlined Off-Policy Learning
The field of Deep Reinforcement Learning (DRL) has recently seen a surge...
read it
-
Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past
Soft Actor-Critic (SAC) is an off-policy actor-critic deep reinforcement...
read it

Che Wang
is this you? claim profile