Algorithms based on the entropy regularized framework, such as Soft
Q-le...
This manuscript introduces the idea of using Distributionally Robust
Opt...
Generalization to unknown/uncertain environments of reinforcement learni...
In many online applications interactions between a user and a web-servic...
Recommender systems objectives can be broadly characterized as modeling ...
We propose Meta-Prod2vec, a novel method to compute item similarities fo...