Model-based Bayesian Reinforcement Learning for Dialogue Management

04/05/2013
by   Pierre Lison, et al.
0

Reinforcement learning methods are increasingly used to optimise dialogue policies from experience. Most current techniques are model-free: they directly estimate the utility of various actions, without explicit model of the interaction dynamics. In this paper, we investigate an alternative strategy grounded in model-based Bayesian reinforcement learning. Bayesian inference is used to maintain a posterior distribution over the model parameters, reflecting the model uncertainty. This parameter distribution is gradually refined as more data is collected and simultaneously used to plan the agent's actions. Within this learning framework, we carried out experiments with two alternative formalisations of the transition model, one encoded with standard multinomial distributions, and one structured with probabilistic rules. We demonstrate the potential of our approach with empirical results on a user simulator constructed from Wizard-of-Oz data in a human-robot interaction scenario. The results illustrate in particular the benefits of capturing prior domain knowledge with high-level rules.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/13/2012

Model-Based Bayesian Reinforcement Learning in Large Structured Domains

Model-based Bayesian reinforcement learning has generated significant in...
research
08/12/2023

Value-Distributional Model-Based Reinforcement Learning

Quantifying uncertainty about a policy's long-term performance is import...
research
12/19/2014

Grounding Hierarchical Reinforcement Learning Models for Knowledge Transfer

Methods of deep machine learning enable to to reuse low-level representa...
research
07/22/2023

On-Robot Bayesian Reinforcement Learning for POMDPs

Robot learning is often difficult due to the expense of gathering data. ...
research
07/10/2019

Interpretable Dynamics Models for Data-Efficient Reinforcement Learning

In this paper, we present a Bayesian view on model-based reinforcement l...
research
02/20/2021

Towards Automatic Evaluation of Dialog Systems: A Model-Free Off-Policy Evaluation Approach

Reliable automatic evaluation of dialogue systems under an interactive e...
research
04/01/2021

Residual Model Learning for Microrobot Control

A majority of microrobots are constructed using compliant materials that...

Please sign up or login with your details

Forgot password? Click here to reset