Graph Neural Network Policies and Imitation Learning for Multi-Domain Task-Oriented Dialogues

10/11/2022
by   Thibault Cordier, et al.
0

Task-oriented dialogue systems are designed to achieve specific goals while conversing with humans. In practice, they may have to handle simultaneously several domains and tasks. The dialogue manager must therefore be able to take into account domain changes and plan over different domains/tasks in order to deal with multidomain dialogues. However, learning with reinforcement in such context becomes difficult because the state-action dimension is larger while the reward signal remains scarce. Our experimental results suggest that structured policies based on graph neural networks combined with different degrees of imitation learning can effectively handle multi-domain dialogues. The reported experiments underline the benefit of structured policies over standard policies.

READ FULL TEXT

page 9

page 10

research
04/18/2018

Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems

In this work, we present a hybrid learning method for training task-orie...
research
02/22/2023

Few-Shot Structured Policy Learning for Multi-Domain and Multi-Task Dialogues

Reinforcement learning has been widely adopted to model dialogue manager...
research
02/07/2020

I love your chain mail! Making knights smile in a fantasy game world: Open-domain goal-oriented dialogue agents

Dialogue research tends to distinguish between chit-chat and goal-orient...
research
02/07/2020

I love your chain mail! Making knights smile in a fantasy game world: Open-domain goal-orientated dialogue agents

Dialogue research tends to distinguish between chit-chat and goal-orient...
research
12/09/2018

Dialogue Generation: From Imitation Learning to Inverse Reinforcement Learning

The performance of adversarial dialogue generation models relies on the ...
research
05/06/2023

Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization

Policy learning (PL) is a module of a task-oriented dialogue system that...
research
01/07/2020

Attention over Parameters for Dialogue Systems

Dialogue systems require a great deal of different but complementary exp...

Please sign up or login with your details

Forgot password? Click here to reset