Mridul Agarwal

research

∙ 09/13/2021

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach

Reinforcement learning is widely used in applications where one needs to...

0 Qinbo Bai, et al. ∙

research

∙ 09/12/2021

Concave Utility Reinforcement Learning with Zero-Constraint Violations

We consider the problem of tabular infinite horizon concave utility rein...

0 Mridul Agarwal, et al. ∙

research

∙ 09/09/2021

On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC)

Mean field control (MFC) is an effective way to mitigate the curse of di...

0 Washim Uddin Mondal, et al. ∙

research

∙ 06/12/2021

Markov Decision Processes with Long-Term Average Constraints

We consider the problem of constrained Markov Decision Process (CMDP) wh...

0 Mridul Agarwal, et al. ∙

research

∙ 05/28/2021

Joint Optimization of Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm

Many engineering problems have multiple objectives, and the overall aim ...

0 Qinbo Bai, et al. ∙

research

∙ 02/22/2021

Communication Efficient Parallel Reinforcement Learning

We consider the problem where M agents interact with M identical and ind...

0 Mridul Agarwal, et al. ∙

research

∙ 02/10/2021

Multi-Agent Multi-Armed Bandits with Limited Communication

We consider the problem where N agents collaboratively interact with an ...

0 Mridul Agarwal, et al. ∙

research

∙ 11/16/2020

Blind Decision Making: Reinforcement Learning with Delayed Observations

Reinforcement learning typically assumes that the state update from the ...

0 Mridul Agarwal, et al. ∙

research

∙ 11/16/2020

DART: aDaptive Accept RejecT for non-linear top-K subset identification

We consider the bandit problem of selecting K out of N arms at each time...

0 Mridul Agarwal, et al. ∙

research

∙ 10/03/2019

Escaping Saddle Points for Zeroth-order Nonconvex Optimization using Estimated Gradient Descent

Gradient descent and its variants are widely used in machine learning. H...

0 Qinbo Bai, et al. ∙

research

∙ 09/06/2019

Encoders and Decoders for Quantum Expander Codes Using Machine Learning

Quantum key distribution (QKD) allows two distant parties to share encry...

0 Sathwik Chadaga, et al. ∙

research

∙ 09/06/2019

A Reinforcement Learning Based Approach for Joint Multi-Agent Decision Making

Reinforcement Learning (RL) is being increasingly applied to optimize co...

0 Mridul Agarwal, et al. ∙

research

∙ 11/29/2018

Regret Bounds for Stochastic Combinatorial Multi-Armed Bandits with Linear Space Complexity

Many real-world problems face the dilemma of choosing best K out of N op...

0 Mridul Agarwal, et al. ∙

Mridul Agarwal

Featured Co-authors

Sign in with Google

Consider DeepAI Pro