Solving optimization problems with Blackwell approachability

02/24/2022

∙

We introduce the Conic Blackwell Algorithm^+ (CBA^+) regret minimizer, a new parameter- and scale-free regret minimizer for general convex sets. CBA^+ is based on Blackwell approachability and attains O(√(T)) regret. We show how to efficiently instantiate CBA^+ for many decision sets of interest, including the simplex, ℓ_p norm balls, and ellipsoidal confidence regions in the simplex. Based on CBA^+, we introduce SP-CBA^+, a new parameter-free algorithm for solving convex-concave saddle-point problems, which achieves a O(1/√(T)) ergodic rate of convergence. In our simulations, we demonstrate the wide applicability of SP-CBA^+ on several standard saddle-point problems, including matrix games, extensive-form games, distributionally robust logistic regression, and Markov decision processes. In each setting, SP-CBA^+ achieves state-of-the-art numerical performance, and outperforms classical methods, without the need for any choice of step sizes or other algorithmic parameters.

READ FULL TEXT

Solving optimization problems with Blackwell approachability

Sign in with Google

Consider DeepAI Pro