Reinforcement learning (RL) has shown promise in creating robust policie...
We study the variance of the REINFORCE policy gradient estimator in
envi...
We propose a method to maintain high resource in a networked heterogeneo...
Quadrotor stabilizing controllers often require careful, model-specific
...