Adversarial recovery of agent rewards from latent spaces of the limit order book

12/09/2019
by   Jacobo Roa-Vicens, et al.
6

Inverse reinforcement learning has proved its ability to explain state-action trajectories of expert agents by recovering their underlying reward functions in increasingly challenging environments. Recent advances in adversarial learning have allowed extending inverse RL to applications with non-stationary environment dynamics unknown to the agents, arbitrary structures of reward functions and improved handling of the ambiguities inherent to the ill-posed nature of inverse RL. This is particularly relevant in real time applications on stochastic environments involving risk, like volatile financial markets. Moreover, recent work on simulation of complex environments enable learning algorithms to engage with real market data through simulations of its latent space representations, avoiding a costly exploration of the original environment. In this paper, we explore whether adversarial inverse RL algorithms can be adapted and trained within such latent space simulations from real market data, while maintaining their ability to recover agent rewards robust to variations in the underlying dynamics, and transfer them to new regimes of the original environment.

READ FULL TEXT
research
02/20/2020

oIRL: Robust Adversarial Inverse Reinforcement Learning with Temporally Extended Actions

Explicit engineering of reward functions for given environments has been...
research
06/11/2019

Towards Inverse Reinforcement Learning for Limit Order Book Dynamics

Multi-agent learning is a promising method to simulate aggregate competi...
research
02/24/2020

Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic

Multi-agent adversarial inverse reinforcement learning (MA-AIRL) is a re...
research
02/06/2022

Learning Synthetic Environments and Reward Networks for Reinforcement Learning

We introduce Synthetic Environments (SEs) and Reward Networks (RNs), rep...
research
11/17/2020

Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization

The problem of inverse reinforcement learning (IRL) is relevant to a var...
research
12/04/2017

Inferring agent objectives at different scales of a complex adaptive system

We introduce a framework to study the effective objectives at different ...
research
06/13/2022

On the Design of Decentralised Data Markets

We present an architecture to implement a decentralised data market, whe...

Please sign up or login with your details

Forgot password? Click here to reset