research
∙
08/28/2022
Normality-Guided Distributional Reinforcement Learning for Continuous Control
Learning a predictive model of the mean return, or value function, plays...
research
∙
10/08/2021
Training Transition Policies via Distribution Matching for Complex Tasks
Humans decompose novel complex tasks into simpler ones to exploit previo...
research
∙
10/20/2020