Reinforcement Learning Your Way: Agent Characterization through Policy Regularization

01/21/2022
by   Charl Maree, et al.
0

The increased complexity of state-of-the-art reinforcement learning (RL) algorithms have resulted in an opacity that inhibits explainability and understanding. This has led to the development of several post-hoc explainability methods that aim to extract information from learned policies thus aiding explainability. These methods rely on empirical observations of the policy and thus aim to generalize a characterization of agents' behaviour. In this study, we have instead developed a method to imbue a characteristic behaviour into agents' policies through regularization of their objective functions. Our method guides the agents' behaviour during learning which results in an intrinsic characterization; it connects the learning process with model explanation. We provide a formal argument and empirical evidence for the viability of our method. In future work, we intend to employ it to develop agents that optimize individual financial customers' investment portfolios based on their spending personalities.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/18/2022

Boolean Decision Rules for Reinforcement Learning Policy Summarisation

Explainability of Reinforcement Learning (RL) policies remains a challen...
research
02/18/2022

Can Interpretable Reinforcement Learning Manage Assets Your Way?

Personalisation of products and services is fast becoming the driver of ...
research
12/02/2020

Policy Supervectors: General Characterization of Agents by their Behaviour

By studying the underlying policies of decision-making agents, we can le...
research
08/26/2022

Symbolic Explanation of Affinity-Based Reinforcement Learning Agents with Markov Models

The proliferation of artificial intelligence is increasingly dependent o...
research
09/02/2022

Semi-Centralised Multi-Agent Reinforcement Learning with Policy-Embedded Training

Centralised training (CT) is the basis for many popular multi-agent rein...
research
04/16/2020

MARLeME: A Multi-Agent Reinforcement Learning Model Extraction Library

Multi-Agent Reinforcement Learning (MARL) encompasses a powerful class o...
research
06/04/2023

Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL

Reinforcement learning agents may sometimes develop habits that are effe...

Please sign up or login with your details

Forgot password? Click here to reset