research
∙
12/22/2017
A short variational proof of equivalence between policy gradients and soft Q learning
Two main families of reinforcement learning algorithms, Q-learning and p...
research
∙
12/19/2017