-
FiDi-RL: Incorporating Deep Reinforcement Learning with Finite-Difference Policy Search for Efficient Learning of Continuous Control
In recent years significant progress has been made in dealing with chall...
read it
-
TBQ(σ): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning
Off-policy reinforcement learning with eligibility traces is challenging...
read it

Longxiang Shi
is this you? claim profile