Apprenticeship Learning for Model Parameters of Partially Observable Environments

06/27/2012
by   Takaki Makino, et al.
0

We consider apprenticeship learning, i.e., having an agent learn a task by observing an expert demonstrating the task in a partially observable environment when the model of the environment is uncertain. This setting is useful in applications where the explicit modeling of the environment is difficult, such as a dialogue system. We show that we can extract information about the environment model by inferring action selection process behind the demonstration, under the assumption that the expert is choosing optimal actions based on knowledge of the true model of the target environment. Proposed algorithms can achieve more accurate estimates of POMDP parameters and better policies from a short demonstration, compared to methods that learns only from the reaction from the environment.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2021

Learning to Act and Observe in Partially Observable Domains

We consider a learning agent in a partially observable environment, with...
research
08/02/2016

Context Discovery for Model Learning in Partially Observable Environments

The ability to learn a model is essential for the success of autonomous ...
research
10/05/2020

The act of remembering: a study in partially observable reinforcement learning

Reinforcement Learning (RL) agents typically learn memoryless policies—p...
research
04/13/2022

Safer Autonomous Driving in a Stochastic, Partially-Observable Environment by Hierarchical Contingency Planning

When learning to act in a stochastic, partially observable environment, ...
research
05/30/2022

GLDQN: Explicitly Parameterized Quantile Reinforcement Learning for Waste Reduction

We study the problem of restocking a grocery store's inventory with peri...
research
02/22/2021

Uncertainty Maximization in Partially Observable Domains: A Cognitive Perspective

Faced with an ever-increasing complexity of their domains of application...
research
07/13/2021

A Hierarchical Bayesian model for Inverse RL in Partially-Controlled Environments

Robots learning from observations in the real world using inverse reinfo...

Please sign up or login with your details

Forgot password? Click here to reset