Efficient Learning for Selecting Top-m Context-Dependent Designs

05/06/2023
by   Gongbo Zhang, et al.
0

We consider a simulation optimization problem for a context-dependent decision-making, which aims to determine the top-m designs for all contexts. Under a Bayesian framework, we formulate the optimal dynamic sampling decision as a stochastic dynamic programming problem, and develop a sequential sampling policy to efficiently learn the performance of each design under each context. The asymptotically optimal sampling ratios are derived to attain the optimal large deviations rate of the worst-case of probability of false selection. The proposed sampling policy is proved to be consistent and its asymptotic sampling ratios are asymptotically optimal. Numerical experiments demonstrate that the proposed method improves the efficiency for selection of top-m context-dependent designs.

READ FULL TEXT
research
12/10/2020

Efficient Learning for Clustering and Optimizing Context-Dependent Designs

We consider a simulation optimization problem for a context-dependent de...
research
11/30/2021

Asymptotically Optimal Sampling Policy for Selecting Top-m Alternatives

We consider selecting the top-m alternatives from a finite number of alt...
research
06/30/2023

Top-Two Thompson Sampling for Contextual Top-mc Selection Problems

We aim to efficiently allocate a fixed simulation budget to identify the...
research
03/28/2023

A reinforced learning approach to optimal design under model uncertainty

Optimal designs are usually model-dependent and likely to be sub-optimal...
research
09/13/2017

Optimal Learning for Sequential Decision Making for Expensive Cost Functions with Stochastic Binary Feedbacks

We consider the problem of sequentially making decisions that are reward...
research
12/10/2020

Context-dependent Ranking and Selection under a Bayesian Framework

We consider a context-dependent ranking and selection problem. The best ...
research
02/13/2021

Diffusion Approximations for a Class of Sequential Testing Problems

We consider a decision maker who must choose an action in order to maxim...

Please sign up or login with your details

Forgot password? Click here to reset