Blackwell's Approachability with Time-Dependent Outcome Functions and Dot Products. Application to the Big Match

03/09/2023
by   Joon Kwon, et al.
0

Blackwell's approachability is a very general sequential decision framework where a Decision Maker obtains vector-valued outcomes, and aims at the convergence of the average outcome to a given "target" set. Blackwell gave a sufficient condition for the decision maker having a strategy guaranteeing such a convergence against an adversarial environment, as well as what we now call the Blackwell's algorithm, which then ensures convergence. Blackwell's approachability has since been applied to numerous problems, in online learning and game theory, in particular. We extend this framework by allowing the outcome function and the dot product to be time-dependent. We establish a general guarantee for the natural extension to this framework of Blackwell's algorithm. In the case where the target set is an orthant, we present a family of time-dependent dot products which yields different convergence speeds for each coordinate of the average outcome. We apply this framework to the Big Match (one of the most important toy examples of stochastic games) where an ϵ-uniformly optimal strategy for Player I is given by Blackwell's algorithm in a well-chosen auxiliary approachability problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/28/2016

Approachability of convex sets in generalized quitting games

We consider Blackwell approachability, a very powerful and geometric too...
research
03/20/2013

Time-Dependent Utility and Action Under Uncertainty

We discuss representing and reasoning with knowledge about the time-depe...
research
02/10/2014

Approachability in unknown games: Online learning meets multi-objective optimization

In the standard setting of approachability there are two players and a t...
research
09/08/2020

Refined approachability algorithms and application to regret minimization with global costs

Blackwell's approachability is a framework where two players, the Decisi...
research
02/06/2023

Random Forests for time-fixed and time-dependent predictors: The DynForest R package

The R package DynForest implements random forests for predicting a categ...
research
10/06/2021

Policy iteration method for time-dependent Mean Field Games systems with non-separable Hamiltonians

We introduce two algorithms based on a policy iteration method to numeri...
research
02/06/2013

Time-Critical Reasoning: Representations and Application

We review the problem of time-critical action and discuss a reformulatio...

Please sign up or login with your details

Forgot password? Click here to reset