Contextual Inverse Optimization: Offline and Online Learning

06/26/2021
by   Omar Besbes, et al.
0

We study the problems of offline and online contextual optimization with feedback information, where instead of observing the loss, we observe, after-the-fact, the optimal action an oracle with full knowledge of the objective function would have taken. We aim to minimize regret, which is defined as the difference between our losses and the ones incurred by an all-knowing oracle. In the offline setting, the decision-maker has information available from past periods and needs to make one decision, while in the online setting, the decision-maker optimizes decisions dynamically over time based a new set of feasible actions and contextual functions in each period. For the offline setting, we characterize the optimal minimax policy, establishing the performance that can be achieved as a function of the underlying geometry of the information induced by the data. In the online setting, we leverage this geometric characterization to optimize the cumulative regret. We develop an algorithm that yields the first regret bound for this problem that is logarithmic in the time horizon.

READ FULL TEXT

page 15

page 25

research
04/20/2018

Online Improper Learning with an Approximation Oracle

We revisit the question of reducing online learning to approximate optim...
research
02/09/2022

Smoothed Online Learning is as Easy as Statistical Learning

Much of modern learning theory has been split between two regimes: the c...
research
02/12/2022

Coupling Online-Offline Learning for Multi-distributional Data Streams

The distributions of real-life data streams are usually nonstationary, w...
research
10/30/2018

An Online-Learning Approach to Inverse Optimization

In this paper, we demonstrate how to learn the objective function of a d...
research
04/08/2022

Decision-Dependent Risk Minimization in Geometrically Decaying Dynamic Environments

This paper studies the problem of expected loss minimization given a dat...
research
03/31/2022

Online Learning for Traffic Routing under Unknown Preferences

In transportation networks, users typically choose routes in a decentral...
research
05/03/2018

Nonparametric Learning and Optimization with Covariates

Modern decision analytics frequently involves the optimization of an obj...

Please sign up or login with your details

Forgot password? Click here to reset