InfoRL: Interpretable Reinforcement Learning using Information Maximization

05/24/2019
by   Aadil Hayat, et al.
0

Recent advances in reinforcement learning have proved that given an environment we can learn to perform a task in that environment if we have access to some form of a reward function (dense, sparse or derived from IRL). But most of the algorithms focus on learning a single best policy to perform a given set of tasks. In this paper, we focus on an algorithm that learns to not just perform a task but different ways to perform the same task. As we know when the environment is complex enough there always exists multiple ways to perform a task. We show that using the concept of information maximization it is possible to learn latent codes for discovering multiple ways to perform any given task in an environment.

READ FULL TEXT

page 5

page 6

page 7

research
09/22/2022

Identifiability and generalizability from multiple experts in Inverse Reinforcement Learning

While Reinforcement Learning (RL) aims to train an agent from a reward f...
research
07/26/2019

Environment Probing Interaction Policies

A key challenge in reinforcement learning (RL) is environment generaliza...
research
01/25/2016

Towards Resolving Unidentifiability in Inverse Reinforcement Learning

We consider a setting for Inverse Reinforcement Learning (IRL) where the...
research
09/21/2018

Interpretable Multi-Objective Reinforcement Learning through Policy Orchestration

Autonomous cyber-physical agents and systems play an increasingly large ...
research
11/22/2019

Fleet Control using Coregionalized Gaussian Process Policy Iteration

In many settings, as for example wind farms, multiple machines are insta...
research
07/12/2022

DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization

Recent algorithms designed for reinforcement learning tasks focus on fin...
research
11/27/2020

Efficient Information Diffusion in Time-Varying Graphs through Deep Reinforcement Learning

Network seeding for efficient information diffusion over time-varying gr...

Please sign up or login with your details

Forgot password? Click here to reset