End-to-End Policy Gradient Method for POMDPs and Explainable Agents

04/19/2023
by   Soichiro Nishimori, et al.
0

Real-world decision-making problems are often partially observable, and many can be formulated as a Partially Observable Markov Decision Process (POMDP). When we apply reinforcement learning (RL) algorithms to the POMDP, reasonable estimation of the hidden states can help solve the problems. Furthermore, explainable decision-making is preferable, considering their application to real-world tasks such as autonomous driving cars. We proposed an RL algorithm that estimates the hidden states by end-to-end training, and visualize the estimation as a state-transition graph. Experimental results demonstrated that the proposed algorithm can solve simple POMDP problems and that the visualization makes the agent's behavior interpretable to humans.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/13/2021

Reinforcement Learning Based Safe Decision Making for Highway Autonomous Driving

In this paper, we develop a safe decision-making method for self-driving...
research
06/15/2023

Semantic HELM: An Interpretable Memory for Reinforcement Learning

Reinforcement learning agents deployed in the real world often have to c...
research
04/22/2021

Reinforcement Learning using Guided Observability

Due to recent breakthroughs, reinforcement learning (RL) has demonstrate...
research
05/08/2023

Goal-oriented inference of environment from redundant observations

The agent learns to organize decision behavior to achieve a behavioral g...
research
03/01/2020

Learning to Simulate Human Movement

Modeling how human moves on the space is useful for policy-making in tra...
research
02/15/2021

Uncovering Interpretable Internal States of Merging Tasks at Highway On-Ramps for Autonomous Driving Decision-Making

Humans make daily-routine decisions based on their internal states in in...
research
03/06/2023

Scenario-Agnostic Zero-Trust Defense with Explainable Threshold Policy: A Meta-Learning Approach

The increasing connectivity and intricate remote access environment have...

Please sign up or login with your details

Forgot password? Click here to reset