Solving optimal stopping problems with Deep Q-Learning

01/24/2021
by   John Ery, et al.
0

We propose a reinforcement learning (RL) approach to model optimal exercise strategies for option-type products. We pursue the RL avenue in order to learn the optimal action-value function of the underlying stopping problem. In addition to retrieving the optimal Q-function at any time step, one can also price the contract at inception. We first discuss the standard setting with one exercise right, and later extend this framework to the case of multiple stopping opportunities in the presence of constraints. We propose to approximate the Q-function with a deep neural network, which does not require the specification of basis functions as in the least-squares Monte Carlo framework and is scalable to higher dimensions. We derive a lower bound on the option price obtained from the trained neural network and an upper bound from the dual formulation of the stopping problem, which can also be expressed in terms of the Q-function. Our methodology is illustrated with examples covering the pricing of swing options.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/19/2021

Deep Reinforcement Learning for Optimal Stopping with Application in Financial Engineering

Optimal stopping is the problem of deciding the right time at which to t...
research
10/19/2022

Deep neural network expressivity for optimal stopping problems

This article studies deep neural network expression rates for optimal st...
research
08/05/2019

Solving high-dimensional optimal stopping problems using deep learning

Nowadays many financial derivatives which are traded on stock and future...
research
06/01/2021

Optimal Stopping with Behaviorally Biased Agents: The Role of Loss Aversion and Changing Reference Points

People are often reluctant to sell a house, or shares of stock, below th...
research
02/24/2023

Simultaneous upper and lower bounds of American option prices with hedging via neural networks

In this paper, we introduce two methods to solve the American-style opti...
research
08/07/2018

Optimal stopping via deeply boosted backward regression

In this note we propose a new approach towards solving numerically optim...
research
05/11/2022

RLOP: RL Methods in Option Pricing from a Mathematical Perspective

Abstract In this work, we build two environments, namely the modified QL...

Please sign up or login with your details

Forgot password? Click here to reset