Data-Driven Sample Average Approximation with Covariate Information

07/27/2022
by   Rohit Kannan, et al.
0

We study optimization for data-driven decision-making when we have observations of the uncertain parameters within the optimization model together with concurrent observations of covariates. Given a new covariate observation, the goal is to choose a decision that minimizes the expected cost conditioned on this observation. We investigate three data-driven frameworks that integrate a machine learning prediction model within a stochastic programming sample average approximation (SAA) for approximating the solution to this problem. Two of the SAA frameworks are new and use out-of-sample residuals of leave-one-out prediction models for scenario generation. The frameworks we investigate are flexible and accommodate parametric, nonparametric, and semiparametric regression techniques. We derive conditions on the data generation process, the prediction model, and the stochastic program under which solutions of these data-driven SAAs are consistent and asymptotically optimal, and also derive convergence rates and finite sample guarantees. Computational experiments validate our theoretical results, demonstrate the potential advantages of our data-driven formulations over existing approaches (even when the prediction model is misspecified), and illustrate the benefits of our new data-driven formulations in the limited data regime.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/02/2020

Residuals-based distributionally robust optimization with covariate information

We consider data-driven approaches that integrate a machine learning pre...
research
01/08/2021

Heteroscedasticity-aware residuals-based contextual stochastic optimization

We explore generalizations of some integrated learning and optimization ...
research
07/19/2022

Holistic Robust Data-Driven Decisions

The design of data-driven formulations for machine learning and decision...
research
07/01/2020

Data-Driven Method for Enhanced Corrosion Assessment of Reinforced Concrete Structures

Corrosion is a major problem affecting the durability of reinforced conc...
research
09/14/2021

Learning and Decision-Making with Data: Optimal Formulations and Phase Transitions

We study the problem of designing optimal learning and decision-making f...
research
09/20/2023

Optimize-via-Predict: Realizing out-of-sample optimality in data-driven optimization

We examine a stochastic formulation for data-driven optimization wherein...
research
01/04/2018

Model Class Reliance: Variable Importance Measures for any Machine Learning Model Class, from the "Rashomon" Perspective

There are serious drawbacks to many current variable importance (VI) met...

Please sign up or login with your details

Forgot password? Click here to reset