Inferring agent objectives at different scales of a complex adaptive system

12/04/2017
by   Dieter Hendricks, et al.
0

We introduce a framework to study the effective objectives at different time scales of financial market microstructure. The financial market can be regarded as a complex adaptive system, where purposeful agents collectively and simultaneously create and perceive their environment as they interact with it. It has been suggested that multiple agent classes operate in this system, with a non-trivial hierarchy of top-down and bottom-up causation classes with different effective models governing each level. We conjecture that agent classes may in fact operate at different time scales and thus act differently in response to the same perceived market state. Given scale-specific temporal state trajectories and action sequences estimated from aggregate market behaviour, we use Inverse Reinforcement Learning to compute the effective reward function for the aggregate agent class at each scale, allowing us to assess the relative attractiveness of feature vectors across different scales. Differences in reward functions for feature vectors may indicate different objectives of market participants, which could assist in finding the scale boundary for agent classes. This has implications for learning algorithms operating in this domain.

READ FULL TEXT
research
04/08/2021

Optimal Market Making by Reinforcement Learning

We apply Reinforcement Learning algorithms to solve the classic quantita...
research
06/11/2019

Towards Inverse Reinforcement Learning for Limit Order Book Dynamics

Multi-agent learning is a promising method to simulate aggregate competi...
research
05/02/2021

Curious Exploration and Return-based Memory Restoration for Deep Reinforcement Learning

Reward engineering and designing an incentive reward function are non-tr...
research
03/13/2023

Many learning agents interacting with an agent-based market model

We consider the dynamics and the interactions of multiple reinforcement ...
research
12/09/2019

Adversarial recovery of agent rewards from latent spaces of the limit order book

Inverse reinforcement learning has proved its ability to explain state-a...
research
08/26/2020

Assessment of Reward Functions for Reinforcement Learning Traffic Signal Control under Real-World Limitations

Adaptive traffic signal control is one key avenue for mitigating the gro...
research
07/31/2019

Inverse Reinforcement Learning with Multiple Ranked Experts

We consider the problem of learning to behave optimally in a Markov Deci...

Please sign up or login with your details

Forgot password? Click here to reset