In model-based reinforcement learning, planning with an imperfect model ...
Temporal difference methods enable efficient estimation of value functio...
Distribution and sample models are two popular model choices in model-ba...
Model-based strategies for control are critical to obtain sample efficie...