Randomized Policy Optimization for Optimal Stopping

03/25/2022
by   Xinyi Guan, et al.
0

Optimal stopping is the problem of determining when to stop a stochastic system in order to maximize reward, which is of practical importance in domains such as finance, operations management and healthcare. Existing methods for high-dimensional optimal stopping that are popular in practice produce deterministic linear policies – policies that deterministically stop based on the sign of a weighted sum of basis functions – but are not guaranteed to find the optimal policy within this policy class given a fixed basis function architecture. In this paper, we propose a new methodology for optimal stopping based on randomized linear policies, which choose to stop with a probability that is determined by a weighted sum of basis functions. We motivate these policies by establishing that under mild conditions, given a fixed basis function architecture, optimizing over randomized linear policies is equivalent to optimizing over deterministic linear policies. We formulate the problem of learning randomized linear policies from data as a smooth non-convex sample average approximation (SAA) problem. We theoretically prove the almost sure convergence of our randomized policy SAA problem and establish bounds on the out-of-sample performance of randomized policies obtained from our SAA problem based on Rademacher complexity. We also show that the SAA problem is in general NP-Hard, and consequently develop a practical heuristic for solving our randomized policy problem. Through numerical experiments on a benchmark family of option pricing problem instances, we show that our approach can substantially outperform state-of-the-art methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/18/2018

Interpretable Optimal Stopping

Optimal stopping is the problem of deciding when to stop a stochastic sy...
research
10/30/2021

Intrusion Prevention through Optimal Stopping

We study automated intrusion prevention using reinforcement learning. Fo...
research
05/19/2021

Deep Reinforcement Learning for Optimal Stopping with Application in Financial Engineering

Optimal stopping is the problem of deciding the right time at which to t...
research
04/28/2021

Optimal Stopping via Randomized Neural Networks

This paper presents new machine learning approaches to approximate the s...
research
07/06/2018

Beating the curse of dimensionality in options pricing and optimal stopping

The fundamental problems of pricing high-dimensional path-dependent opti...
research
08/18/2023

The Last Success Problem with a Single Sample

The last success problem is an optimal stopping problem that aims to max...
research
12/25/2019

Asymptotically Optimal Sampling Policy for Quickest Change Detection with Observation-Switching Cost

We consider the problem of quickest change detection (QCD) in a signal w...

Please sign up or login with your details

Forgot password? Click here to reset