research
          
      
      ∙
      04/05/2022
    Configuration Path Control
Reinforcement learning methods often produce brittle policies – policies...
          
            research
          
      
      ∙
      06/20/2021
    Three-dimensional bipedal model with zero-energy-cost walking
We study a three-dimensional articulated rigid-body biped model that pos...
          
            research
          
      
      ∙
      11/15/2018
    Reward-estimation variance elimination in sequential decision processes
Policy gradient methods are very attractive in reinforcement learning du...
          
            research
          
      
      ∙
      10/01/2011