Coordinate Methods for Matrix Games

09/17/2020
by   Yair Carmon, et al.
0

We develop primal-dual coordinate methods for solving bilinear saddle-point problems of the form min_x ∈𝒳max_y∈𝒴 y^⊤ A x which contain linear programming, classification, and regression as special cases. Our methods push existing fully stochastic sublinear methods and variance-reduced methods towards their limits in terms of per-iteration complexity and sample complexity. We obtain nearly-constant per-iteration complexity by designing efficient data structures leveraging Taylor approximations to the exponential and a binomial heap. We improve sample complexity via low-variance gradient estimators using dynamic sampling distributions that depend on both the iterates and the magnitude of the matrix entries. Our runtime bounds improve upon those of existing primal-dual methods by a factor depending on sparsity measures of the m by n matrix A. For example, when rows and columns have constant ℓ_1/ℓ_2 norm ratios, we offer improvements by a factor of m+n in the fully stochastic setting and √(m+n) in the variance-reduced setting. We apply our methods to computational geometry problems, i.e. minimum enclosing ball, maximum inscribed ball, and linear regression, and obtain improved complexity bounds. For linear regression with an elementwise nonnegative matrix, our guarantees improve on exact gradient methods by a factor of √(nnz(A)/(m+n)).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/03/2019

Variance Reduction for Matrix Games

We present a randomized primal-dual algorithm that solves the problem _x...
research
11/10/2021

Linear Convergence of Stochastic Primal Dual Methods for Linear Programming Using Variance Reduction and Restarts

There is a recent interest on first-order methods for linear programming...
research
01/19/2022

On the Complexity of a Practical Primal-Dual Coordinate Method

We prove complexity bounds for the primal-dual algorithm with random ext...
research
11/02/2021

Coordinate Linear Variance Reduction for Generalized Linear Programming

We study a class of generalized linear programs (GLP) in a large-scale s...
research
07/14/2023

Variance-reduced accelerated methods for decentralized stochastic double-regularized nonconvex strongly-concave minimax problems

In this paper, we consider the decentralized, stochastic nonconvex stron...
research
10/20/2022

Local SGD in Overparameterized Linear Regression

We consider distributed learning using constant stepsize SGD (DSGD) over...
research
06/13/2019

Variance Estimation For Online Regression via Spectrum Thresholding

We consider the online linear regression problem, where the predictor ve...

Please sign up or login with your details

Forgot password? Click here to reset