
On Wasserstein Reinforcement Learning and the FokkerPlanck equation
Policy gradients methods often achieve better performance when the chang...
read it

A short variational proof of equivalence between policy gradients and soft Q learning
Two main families of reinforcement learning algorithms, Qlearning and p...
read it

Combining learning rate decay and weight decay with complexity gradient descent  Part I
The role of L^2 regularization, in the specific case of deep neural netw...
read it

Static Activation Function Normalization
Recent seminal work at the intersection of deep neural networks practice...
read it

Biologically inspired architectures for sampleefficient deep reinforcement learning
Deep reinforcement learning requires a heavy price in terms of sample ef...
read it
Pierre H. Richemond
is this you? claim profile