Online Learning to Transport via the Minimal Selection Principle

02/09/2022
by   Wenxuan Guo, et al.
0

Motivated by robust dynamic resource allocation in operations research, we study the Online Learning to Transport (OLT) problem where the decision variable is a probability measure, an infinite-dimensional object. We draw connections between online learning, optimal transport, and partial differential equations through an insight called the minimal selection principle, originally studied in the Wasserstein gradient flow setting by Ambrosio et al. (2005). This allows us to extend the standard online learning framework to the infinite-dimensional setting seamlessly. Based on our framework, we derive a novel method called the minimal selection or exploration (MSoE) algorithm to solve OLT problems using mean-field approximation and discretization techniques. In the displacement convex setting, the main theoretical message underpinning our approach is that minimizing transport cost over time (via the minimal selection principle) ensures optimal cumulative regret upper bounds. On the algorithmic side, our MSoE algorithm applies beyond the displacement convex setting, making the mathematical theory of optimal transport practically relevant to non-convex settings common in dynamic resource allocation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/19/2019

Unconditional convergence for discretizations of dynamical optimal transport

The dynamical formulation of optimal transport, also known as Benamou-Br...
research
05/01/2019

On Scalable and Efficient Computation of Large Scale Optimal Transport

Optimal Transport (OT) naturally arises in many machine learning applica...
research
05/04/2020

No-Regret Stateful Posted Pricing

In this paper, a rather general online problem called dynamic resource a...
research
02/28/2022

Online Learning with Knapsacks: the Best of Both Worlds

We study online learning problems in which a decision maker wants to max...
research
02/18/2021

A Mathematical Principle of Deep Learning: Learn the Geodesic Curve in the Wasserstein Space

Recent studies revealed the mathematical connection of deep neural netwo...
research
10/21/2019

Robust Online Learning for Resource Allocation – Beyond Euclidean Projection and Dynamic Fit

Online-learning literature has focused on designing algorithms that ensu...
research
10/07/2019

A mathematical theory of cooperative communication

Cooperative communication plays a central role in theories of human cogn...

Please sign up or login with your details

Forgot password? Click here to reset