Frank Wolfe Meets Metric Entropy

05/17/2022
by   Suhas Vijaykumar, et al.
0

The Frank-Wolfe algorithm has seen a resurgence in popularity due to its ability to efficiently solve constrained optimization problems in machine learning and high-dimensional statistics. As such, there is much interest in establishing when the algorithm may possess a "linear" O(log(1/ϵ)) dimension-free iteration complexity comparable to projected gradient descent. In this paper, we provide a general technique for establishing domain specific and easy-to-estimate lower bounds for Frank-Wolfe and its variants using the metric entropy of the domain. Most notably, we show that a dimension-free linear upper bound must fail not only in the worst case, but in the average case: for a Gaussian or spherical random polytope in ℝ^d with poly(d) vertices, Frank-Wolfe requires up to Ω̃(d) iterations to achieve a O(1/d) error bound, with high probability. We also establish this phenomenon for the nuclear norm ball. The link with metric entropy also has interesting positive implications for conditional gradient algorithms in statistics, such as gradient boosting and matching pursuit. In particular, we show that it is possible to extract fast-decaying upper bounds on the excess risk directly from an analysis of the underlying optimization procedure.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/30/2023

The Iteration Number of the Weisfeiler-Leman Algorithm

We prove new upper and lower bounds on the number of iterations the k-di...
research
02/23/2023

Logistic Regression and Classification with non-Euclidean Covariates

We introduce a logistic regression model for data pairs consisting of a ...
research
04/04/2018

Stability and Convergence Trade-off of Iterative Optimization Algorithms

The overall performance or expected excess risk of an iterative machine ...
research
12/24/2021

Asymptotic Bounds on the Combinatorial Diameter of Random Polytopes

The combinatorial diameter diam(P) of a polytope P is the maximum shorte...
research
09/29/2021

Pursuit-evasion games on latin square graphs

We investigate various pursuit-evasion parameters on latin square graphs...
research
09/07/2020

Efficient Projection Algorithms onto the Weighted l1 Ball

Projected gradient descent has been proved efficient in many optimizatio...
research
06/15/2023

Average Case Error Estimates of the Strong Lucas Test

Reliable probabilistic primality tests are fundamental in public-key cry...

Please sign up or login with your details

Forgot password? Click here to reset