-
P3O: Policy-on Policy-off Policy Optimization
On-policy reinforcement learning (RL) algorithms have high sample comple...
05/05/2019 ∙ by Rasool Fakoor, et al. ∙36 ∙
share
read it
-
Meta-Q-Learning
This paper introduces Meta-Q-Learning (MQL), a new off-policy algorithm ...
09/30/2019 ∙ by Rasool Fakoor, et al. ∙26 ∙
share
read it
-
Memory-augmented Attention Modelling for Videos
We present a method to improve video description generation by modeling ...
11/07/2016 ∙ by Rasool Fakoor, et al. ∙0 ∙
share
read it
-
Constrained Convolutional-Recurrent Networks to Improve Speech Quality with Low Impact on Recognition Accuracy
For a speech-enhancement algorithm, it is highly desirable to simultaneo...
02/16/2018 ∙ by Rasool Fakoor, et al. ∙0 ∙
share
read it
-
Differentiable Greedy Networks
Optimal selection of a subset of items from a given set is a hard proble...
10/30/2018 ∙ by Thomas Powers, et al. ∙0 ∙
share
read it
-
Direct optimization of F-measure for retrieval-based personal question answering
Recent advances in spoken language technologies and the introduction of ...
09/28/2018 ∙ by Rasool Fakoor, et al. ∙0 ∙
share
read it

Rasool Fakoor
is this you? claim profile