Learn Continuously, Act Discretely: Hybrid Action-Space Reinforcement Learning For Optimal Execution

07/22/2022
by   Feiyang Pan, et al.
6

Optimal execution is a sequential decision-making problem for cost-saving in algorithmic trading. Studies have found that reinforcement learning (RL) can help decide the order-splitting sizes. However, a problem remains unsolved: how to place limit orders at appropriate limit prices? The key challenge lies in the "continuous-discrete duality" of the action space. On the one hand, the continuous action space using percentage changes in prices is preferred for generalization. On the other hand, the trader eventually needs to choose limit prices discretely due to the existence of the tick size, which requires specialization for every single stock with different characteristics (e.g., the liquidity and the price range). So we need continuous control for generalization and discrete control for specialization. To this end, we propose a hybrid RL method to combine the advantages of both of them. We first use a continuous control agent to scope an action subset, then deploy a fine-grained agent to choose a specific limit price. Extensive experiments show that our method has higher sample efficiency and better training stability than existing RL algorithms and significantly outperforms previous learning-based methods for order execution.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/25/2023

Market Making with Deep Reinforcement Learning from Limit Order Books

Market making (MM) is an important research topic in quantitative financ...
research
09/12/2021

HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation

Discrete-continuous hybrid action space is a natural setting in many pra...
research
11/28/2022

Continuous Episodic Control

Non-parametric episodic memory can be used to quickly latch onto high-re...
research
06/21/2022

Imitate then Transcend: Multi-Agent Optimal Execution with Dual-Window Denoise PPO

A novel framework for solving the optimal execution and placement proble...
research
10/10/2018

Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space

Most existing deep reinforcement learning (DRL) frameworks consider eith...
research
11/22/2020

Reinforcement learning with distance-based incentive/penalty (DIP) updates for highly constrained industrial control systems

Typical reinforcement learning (RL) methods show limited applicability f...
research
11/23/2022

Reinforcement learning for traffic signal control in hybrid action space

The prevailing reinforcement-learning-based traffic signal control metho...

Please sign up or login with your details

Forgot password? Click here to reset