Mehdi Jafarnia-Jahromi

research

∙ 09/08/2021

Learning Zero-sum Stochastic Games with Posterior Sampling

In this paper, we propose Posterior Sampling Reinforcement Learning for ...

0 Mehdi Jafarnia-Jahromi, et al. ∙

research

∙ 09/07/2021

Online Learning for Cooperative Multi-Player Multi-Armed Bandits

We introduce a framework for decentralized online learning for multi-arm...

0 William Chang, et al. ∙

research

∙ 06/15/2021

Implicit Finite-Horizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path

We introduce a generic template for developing regret minimization algor...

0 Liyu Chen, et al. ∙

research

∙ 06/09/2021

Online Learning for Stochastic Shortest Path Model via Posterior Sampling

We consider the problem of online reinforcement learning for the Stochas...

0 Mehdi Jafarnia-Jahromi, et al. ∙

research

∙ 02/25/2021

Online Learning for Unknown Partially Observable MDPs

Solving Partially Observable Markov Decision Processes (POMDPs) is hard....

0 Mehdi Jafarnia-Jahromi, et al. ∙

research

∙ 07/23/2020

Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation

We develop several new algorithms for learning Markov Decision Processes...

12 Chen-Yu Wei, et al. ∙

research

∙ 06/08/2020

A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret

Recently, model-free reinforcement learning has attracted research atten...

12 Mehdi Jafarnia-Jahromi, et al. ∙

research

∙ 10/15/2019

Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes

Model-free reinforcement learning is known to be memory and computation ...

0 Chen-Yu Wei, et al. ∙

research

∙ 12/25/2018

PPD: Permutation Phase Defense Against Adversarial Examples in Deep Learning

Deep neural networks have demonstrated cutting edge performance on vario...

12 Mehdi Jafarnia-Jahromi, et al. ∙

Mehdi Jafarnia-Jahromi

Featured Co-authors

Sign in with Google

Consider DeepAI Pro