Bayesian Nonparametric Feature and Policy Learning for Decision-Making

02/26/2017
by   Jürgen Hahn, et al.
0

Learning from demonstrations has gained increasing interest in the recent past, enabling an agent to learn how to make decisions by observing an experienced teacher. While many approaches have been proposed to solve this problem, there is only little work that focuses on reasoning about the observed behavior. We assume that, in many practical problems, an agent makes its decision based on latent features, indicating a certain action. Therefore, we propose a generative model for the states and actions. Inference reveals the number of features, the features, and the policies, allowing us to learn and to analyze the underlying structure of the observed behavior. Further, our approach enables prediction of actions for new states. Simulations are used to assess the performance of the algorithm based upon this model. Moreover, the problem of learning a driver's behavior is investigated, demonstrating the performance of the proposed model in a real-world scenario.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/15/2014

Interactive Policy Learning through Confidence-Based Autonomy

We present Confidence-Based Autonomy (CBA), an interactive algorithm for...
research
04/09/2021

Inverse Reinforcement Learning a Control Lyapunov Approach

Inferring the intent of an intelligent agent from demonstrations and sub...
research
04/19/2017

Simultaneous Policy Learning and Latent State Inference for Imitating Driver Behavior

In this work, we propose a method for learning driver models that accoun...
research
02/27/2018

Multi-agent Time-based Decision-making for the Search and Action Problem

Many robotic applications, such as search-and-rescue, require multiple a...
research
03/08/2022

Policy Regularization for Legible Behavior

In Reinforcement Learning interpretability generally means to provide in...
research
05/17/2022

Moral reinforcement learning using actual causation

Reinforcement learning systems will to a greater and greater extent make...
research
06/05/2019

Lifelong Learning with a Changing Action Set

In many real-world sequential decision making problems, the number of av...

Please sign up or login with your details

Forgot password? Click here to reset