research
∙
04/05/2022
Configuration Path Control
Reinforcement learning methods often produce brittle policies – policies...
research
∙
06/20/2021
Three-dimensional bipedal model with zero-energy-cost walking
We study a three-dimensional articulated rigid-body biped model that pos...
research
∙
11/15/2018
Reward-estimation variance elimination in sequential decision processes
Policy gradient methods are very attractive in reinforcement learning du...
research
∙
10/01/2011