Zero-sum Linear Quadratic (LQ) games are fundamental in optimal control ...
We explore the problem of imitation learning (IL) in the context of
mean...
Trust-region methods based on Kullback-Leibler divergence are pervasivel...
Multi-agent reinforcement learning (MARL) addresses sequential
decision-...
Constrained Markov Decision Processes (CMDPs) are one of the common ways...
We consider the reinforcement learning (RL) problem with general utiliti...
In this paper, we study the statistical efficiency of Reinforcement Lear...
We study policy evaluation of offline contextual bandits subject to
unob...
In this paper, we study the Tiered Reinforcement Learning setting, a par...
Recently, the impressive empirical success of policy gradient (PG) metho...
Mean-field games have been used as a theoretical tool to obtain an
appro...
In real-world decision-making, uncertainty is important yet difficult to...
Natural actor-critic (NAC) and its variants, equipped with the represent...
We study differentially private (DP) algorithms for smooth stochastic mi...
This paper studies the uniform convergence and generalization bounds for...
We study the performance of Stochastic Cubic Regularized Newton (SCRN) o...
The variance reduced gradient estimators for policy gradient methods has...
We consider the reinforcement learning problem for partially observed Ma...
We study the bilinearly coupled minimax problem: min_xmax_y f(x) +
y^⊤ A...
Gradient descent ascent (GDA), the simplest single-loop algorithm for
no...
Natural policy gradient (NPG) methods with function approximation achiev...
This paper studies the complexity for finding approximate stationary poi...
The building sector consumes the largest energy in the world, and there ...
We study the dynamics of temporal-difference learning with neural
networ...
In this paper, we establish a theoretical comparison between the asympto...
Conditional Stochastic Optimization (CSO) covers a variety of applicatio...
The use of target networks is a common practice in deep reinforcement
le...
Nonconvex minimax problems appear frequently in emerging machine learnin...
In this paper, we introduce a unified framework for analyzing a large fa...
This article reviews recent advances in multi-agent reinforcement learni...
In this paper, we study a class of stochastic optimization problems, ref...
We present an efficient algorithm for maximum likelihood estimation (MLE...
The use of target networks has been a popular and key component of recen...
We introduce a new convex optimization problem, termed quadratic decompo...
We investigate penalized maximum log-likelihood estimation for exponenti...
We introduce a new convex optimization problem, termed quadratic decompo...
In this paper, we design a nonparametric online algorithm for estimating...
We revisit the Bellman optimality equation with Nesterov's smoothing
tec...
This paper proposes a new actor-critic-style algorithm called Dual
Actor...
Learning-based binary hashing has become a powerful paradigm for fast se...
Poisson likelihood models have been prevalently used in imaging, social
...
Many machine learning tasks, such as learning with invariance and policy...
Bayesian methods are appealing in their flexibility in modeling complex ...
The general perception is that kernel methods are not scalable, and neur...
We present a stochastic setting for optimization problems with nonsmooth...