In this paper, we study a class of stochastic bilevel optimization probl...
Non-asymptotic convergence analysis of quasi-Newton methods has gained
a...
In this paper, we propose an accelerated quasi-Newton proximal extragrad...
Pruning schemes have been widely used in practice to reduce the complexi...
Quasi-Newton algorithms are among the most popular iterative methods for...
The goal of contrasting learning is to learn a representation that prese...
In order to achieve the dual goals of privacy and learning across distri...
One of the key challenges of learning an online recommendation model is ...
In this paper, we study a class of bilevel optimization problems, also k...
The optimistic gradient method has seen increasing popularity as an effi...
We study convergence rates of AdaGrad-Norm as an exemplar of adaptive
st...
Recent empirical evidence has driven conventional wisdom to believe that...
In this paper, we study the application of quasi-Newton methods for solv...
In this paper, we study the generalization properties of Model-Agnostic
...
Model-Agnostic Meta-Learning (MAML) has demonstrated widespread success ...
In federated learning, communication cost is often a critical bottleneck...
Mobile network operators store an enormous amount of information like lo...
In this paper, we study the non-asymptotic superlinear convergence rate ...
We consider a decentralized stochastic learning problem where data point...
The goal of federated learning is to design algorithms in which several
...
We consider Model-Agnostic Meta-Learning (MAML) methods for Reinforcemen...
The Model-Agnostic Meta-Learning (MAML) algorithm <cit.> has
been celebr...
In this paper, we focus on solving a class of constrained non-convex
non...
One of the beauties of the projected gradient descent method lies in its...
Federated learning is a new distributed machine learning approach, where...
In this paper, we study the convergence theory of a class of gradient-ba...
We consider a decentralized learning problem, where a set of computing n...
In this paper we analyze the iteration complexity of the optimistic grad...
In this paper, we develop Stochastic Continuous Greedy++ (SCG++), the fi...
How can we efficiently mitigate the overhead of gradient communications ...
We consider solving convex-concave saddle point problems. We focus on tw...
In this paper, we propose a Distributed Accumulated Newton Conjugate gra...
In this paper, we focus on escaping from saddle points in smooth nonconv...
We consider the problem of decentralized consensus optimization, where t...
Recently, the decentralized optimization problem is attracting growing
a...
We study gradient-based optimization methods obtained by directly
discre...
This paper considers stochastic optimization problems for a large class ...
We consider large scale empirical risk minimization (ERM) problems, wher...
Existing approaches to resource allocation for nowadays stochastic netwo...
We consider learning problems over training sets in which both, the numb...
RES, a regularized stochastic version of the Broyden-Fletcher-Goldfarb-S...