Interpretable Optimal Stopping

12/18/2018
by   Dragos Florin Ciocan, et al.
0

Optimal stopping is the problem of deciding when to stop a stochastic system to obtain the greatest reward, arising in numerous application areas such as finance, healthcare and marketing. State-of-the-art methods for high-dimensional optimal stopping involve approximating the value function or the continuation value, and then using that approximation within a greedy policy. Although such policies can perform very well, they are generally not guaranteed to be interpretable; that is, a decision maker may not be able to easily see the link between the current system state and the policy's action. In this paper, we propose a new approach to optimal stopping, wherein the policy is represented as a binary tree, in the spirit of naturally interpretable tree models commonly used in machine learning. We formulate the problem of learning such policies from observed trajectories of the stochastic system as a sample average approximation (SAA) problem. We prove that the SAA problem converges under mild conditions as the sample size increases, but that computationally even immediate simplifications of the SAA problem are theoretically intractable. We thus propose a tractable heuristic for approximately solving the SAA problem, by greedily constructing the tree from the top down. We demonstrate the value of our approach by applying it to the canonical problem of option pricing, using both synthetic instances and instances calibrated with real S&P 500 data. Our method obtains policies that (1) outperform state-of-the-art non-interpretable methods, based on simulation-regression and martingale duality, and (2) possess a remarkably simple and intuitive structure.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/25/2022

Randomized Policy Optimization for Optimal Stopping

Optimal stopping is the problem of determining when to stop a stochastic...
research
05/19/2021

Deep Reinforcement Learning for Optimal Stopping with Application in Financial Engineering

Optimal stopping is the problem of deciding the right time at which to t...
research
06/14/2021

Learning Intrusion Prevention Policies through Optimal Stopping

We study automated intrusion prevention using reinforcement learning. In...
research
10/30/2021

Intrusion Prevention through Optimal Stopping

We study automated intrusion prevention using reinforcement learning. Fo...
research
03/04/2022

Interpretable Off-Policy Learning via Hyperbox Search

Personalized treatment decisions have become an integral part of modern ...
research
12/25/2019

Asymptotically Optimal Sampling Policy for Quickest Change Detection with Observation-Switching Cost

We consider the problem of quickest change detection (QCD) in a signal w...
research
07/20/2022

Constrained Prescriptive Trees via Column Generation

With the abundance of available data, many enterprises seek to implement...

Please sign up or login with your details

Forgot password? Click here to reset