Brendan Maginnis | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Pierre H. Richemond
12 publications

research

∙ 12/22/2017

A short variational proof of equivalence between policy gradients and soft Q learning

Two main families of reinforcement learning algorithms, Q-learning and p...

0 Pierre H. Richemond, et al. ∙

research

∙ 12/19/2017

On Wasserstein Reinforcement Learning and the Fokker-Planck equation

Policy gradients methods often achieve better performance when the chang...

0 Pierre H. Richemond, et al. ∙

Success!

An error occurred