Optimize-via-Predict: Realizing out-of-sample optimality in data-driven optimization

09/20/2023
by   Gar Goei Loke, et al.
0

We examine a stochastic formulation for data-driven optimization wherein the decision-maker is not privy to the true distribution, but has knowledge that it lies in some hypothesis set and possesses a historical data set, from which information about it can be gleaned. We define a prescriptive solution as a decision rule mapping such a data set to decisions. As there does not exist prescriptive solutions that are generalizable over the entire hypothesis set, we define out-of-sample optimality as a local average over a neighbourhood of hypotheses, and averaged over the sampling distribution. We prove sufficient conditions for local out-of-sample optimality, which reduces to functions of the sufficient statistic of the hypothesis family. We present an optimization problem that would solve for such an out-of-sample optimal solution, and does so efficiently by a combination of sampling and bisection search algorithms. Finally, we illustrate our model on the newsvendor model, and find strong performance when compared against alternatives in the literature. There are potential implications of our research on end-to-end learning and Bayesian optimization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/30/2019

Data-Driven Model Set Design for Model Averaged Particle Filter

This paper is concerned with sequential state filtering in the presence ...
research
09/14/2021

Learning and Decision-Making with Data: Optimal Formulations and Phase Transitions

We study the problem of designing optimal learning and decision-making f...
research
06/07/2013

Loss-Proportional Subsampling for Subsequent ERM

We propose a sampling scheme suitable for reducing a data set prior to s...
research
07/27/2022

Data-Driven Sample Average Approximation with Covariate Information

We study optimization for data-driven decision-making when we have obser...
research
08/12/2012

How to sample if you must: on optimal functional sampling

We examine a fundamental problem that models various active sampling set...
research
05/27/2021

On the Impossibility of Statistically Improving Empirical Optimization: A Second-Order Stochastic Dominance Perspective

When the underlying probability distribution in a stochastic optimizatio...
research
07/19/2022

Holistic Robust Data-Driven Decisions

The design of data-driven formulations for machine learning and decision...

Please sign up or login with your details

Forgot password? Click here to reset