
Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes
We present new policy mirror descent (PMD) methods for solving reinforce...
Simple and optimal methods for stochastic variational inequalities, II: Markovian noise and policy evaluation in reinforcement learning
The focus of this paper is on stochastic variational inequalities (VI) u...
A Primal Approach to Constrained Policy Optimization: Global Optimality and FiniteTime Analysis
Safe reinforcement learning (SRL) problems are typically modeled as cons...
Simple and optimal methods for stochastic variational inequalities, I: operator extrapolation
In this paper we first present a novel operator extrapolation (OE) metho...
A Feasible Level Proximal Point Method for Nonconvex Sparse Constrained Optimization
Nonconvex sparse models have received significant attention in highdime...
Conditional Gradient Methods for convex optimization with function constraints
Conditional gradient methods have attracted much attention in both machi...
A Unified Singleloop Alternating Gradient Projection Algorithm for NonconvexConcave and ConvexNonconcave Minimax Problems
Much recent research effort has been directed to the development of effi...
Complexity of Stochastic Dual Dynamic Programming
Stochastic dual dynamic programming is a cutting plane type algorithm fo...
Proximal Point Methods for Optimization with Nonconvex Functional Constraints
Nonconvex optimization is becoming more and more important in machine le...
GLAD: Learning Sparse Graph Recovery
Recovering sparse conditional independence graphs from data is a fundame...
A unified variancereduced accelerated gradient method for convex optimization
We propose a novel randomized incremental gradient algorithm, namely, VA...
Cubic Regularization with Momentum for Nonconvex Optimization
Momentum is a popular technique to accelerate the convergence in practic...
Optimal Adaptive and Accelerated Stochastic Gradient Descent
Stochastic gradient descent (Sgd) methods are the most powerful optimiza...
Complexity of Training ReLU Neural Network
In this paper, we explore some basic questions on the complexity of trai...
Asynchronous decentralized accelerated stochastic gradient descent
In this work, we introduce an asynchronous decentralized accelerated sto...
A Note on Inexact Condition for Cubic Regularized Newton's Method
This note considers the inexact cubicregularized Newton's method (CR), ...
Sample Complexity of Stochastic VarianceReduced Cubic Regularization for Nonconvex Optimization
The popular cubic regularization (CR) method converges with first and s...
Random gradient extrapolation for distributed and stochastic optimization
In this paper, we consider a class of finitesum convex optimization pro...
Dynamic Stochastic Approximation for Multistage Stochastic Optimization
In this paper, we consider multistage stochastic optimization problems ...
Conditional Accelerated Lazy Stochastic Gradient Descent
In this work we introduce a conditional accelerated lazy stochastic grad...
Algorithms for stochastic optimization with expectation constraints
This paper considers the problem of minimizing an expectation function o...
Generalized Uniformly Optimal Methods for Nonlinear Programming
In this paper, we present a generic framework to extend existing uniform...
An optimal randomized incremental gradient method
In this paper, we consider a class of finitesum convex optimization pro...
Stochastic First and Zerothorder Methods for Nonconvex Stochastic Programming
In this paper, we introduce a new stochastic approximation (SA) type alg...
Guanghui Lan
