Multiscale Inverse Reinforcement Learning using Diffusion Wavelets

11/24/2016
by   Jung-Su Ha, et al.
0

This work presents a multiscale framework to solve an inverse reinforcement learning (IRL) problem for continuous-time/state stochastic systems. We take advantage of a diffusion wavelet representation of the associated Markov chain to abstract the state space. This not only allows for effectively handling the large (and geometrically complex) decision space but also provides more interpretable representations of the demonstrated state trajectories and also of the resulting policy of IRL. In the proposed framework, the problem is divided into the global and local IRL, where the global approximation of the optimal value functions are obtained using coarse features and the local details are quantified using fine local features. An illustrative numerical example on robot path control in a complex environment is presented to verify the proposed method.

READ FULL TEXT
research
08/22/2019

On Convergence Rate of Adaptive Multiscale Value Function Approximation For Reinforcement Learning

In this paper, we propose a generic framework for devising an adaptive a...
research
08/09/2016

Neuroevolution-Based Inverse Reinforcement Learning

The problem of Learning from Demonstration is targeted at learning to pe...
research
11/10/2017

Model Checking Markov Population Models by Stochastic Approximations

Many complex systems can be described by population models, in which a p...
research
06/30/2020

Preconditioning Markov Chain Monte Carlo Method for Geomechanical Subsidence using multiscale method and machine learning technique

In this paper, we consider the numerical solution of the poroelasticity ...
research
11/30/2022

Reinforcement Learning for Signal Temporal Logic using Funnel-Based Approach

Signal Temporal Logic (STL) is a powerful framework for describing the c...
research
06/15/2021

On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control

Reinforcement learning is a framework for interactive decision-making wi...
research
01/09/2021

Identifying Decision Points for Safe and Interpretable Reinforcement Learning in Hypotension Treatment

Many batch RL health applications first discretize time into fixed inter...

Please sign up or login with your details

Forgot password? Click here to reset