In real-world multi-agent reinforcement learning (MARL) applications, ag...
Offline reinforcement learning aims to find the optimal policy from a
pr...
Robust Markov decision processes (MDPs) address the challenge of model
u...
In robust Markov decision processes (MDPs), the uncertainty in the trans...
The problem of robust binary hypothesis testing is studied. Under both
h...
Secure multi-party computation have seen substantial performance improve...
Electric vehicles (EVs) play critical roles in autonomous mobility-on-de...
Constrained reinforcement learning is to maximize the expected reward su...
This paper develops the first policy gradient method with global optimal...
The problem of robust hypothesis testing is studied, where under the nul...
The problem of constrained Markov decision process (CMDP) is investigate...
Robust reinforcement learning (RL) is to find a policy that optimizes th...
Actor-critic (AC) algorithms have been widely adopted in decentralized
m...
Online detection of changes in stochastic systems, referred to as sequen...
Temporal-difference learning with gradient correction (TDC) is a two
tim...
Greedy-GQ is a value-based reinforcement learning (RL) algorithm for opt...
The first provably efficient algorithm for learning graph neural network...
Variance reduction techniques have been successfully applied to
temporal...
Greedy-GQ is an off-policy two timescale algorithm for optimal control i...
The quality of images captured by wireless capsule endoscopy (WCE) is ke...
Gradient-based temporal difference (GTD) algorithms are widely used in
o...
Though the convergence of major reinforcement learning algorithms has be...
We show that model compression can improve the population risk of a
pre-...
A mutual information based upper bound on the generalization error of a
...
The problem of quickest detection of dynamic events in networks is studi...
The problem of quickest change detection (QCD) under transient dynamics ...
The problem of universal outlying sequence detection is studied, where t...
Nonparametric detection of existence of an anomalous structure over a ne...
A nonparametric anomalous hypothesis testing problem is investigated, in...
The nonparametric problem of detecting existence of an anomalous interva...