Post-processing Networks: Method for Optimizing Pipeline Task-oriented Dialogue Systems using Reinforcement Learning

07/25/2022
by   Atsumoto Ohashi, et al.
0

Many studies have proposed methods for optimizing the dialogue performance of an entire pipeline task-oriented dialogue system by jointly training modules in the system using reinforcement learning. However, these methods are limited in that they can only be applied to modules implemented using trainable neural-based methods. To solve this problem, we propose a method for optimizing a pipeline system composed of modules implemented with arbitrary methods for dialogue performance. With our method, neural-based components called post-processing networks (PPNs) are installed inside such a system to post-process the output of each module. All PPNs are updated to improve the overall dialogue performance of the system by using reinforcement learning, not necessitating each module to be differentiable. Through dialogue simulation and human evaluation on the MultiWOZ dataset, we show that our method can improve the dialogue performance of pipeline systems consisting of various modules.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/21/2020

Rethinking Supervised Learning and Reinforcement Learning in Task-Oriented Dialogue Systems

Dialogue policy learning for task-oriented dialogue systems has enjoyed ...
research
06/03/2011

Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System

Designing the dialogue policy of a spoken dialogue system involves many ...
research
05/12/2019

Metareasoning in Modular Software Systems: On-the-Fly Configuration using Reinforcement Learning with Rich Contextual Representations

Assemblies of modular subsystems are being pressed into service to perfo...
research
06/08/2016

Continuously Learning Neural Dialogue Management

We describe a two-step approach for dialogue management in task-oriented...
research
03/03/2017

End-to-End Task-Completion Neural Dialogue Systems

One of the major drawbacks of modularized task-completion dialogue syste...
research
10/25/2021

Findings from Experiments of On-line Joint Reinforcement Learning of Semantic Parser and Dialogue Manager with real Users

Design of dialogue systems has witnessed many advances lately, yet acqui...
research
10/18/2022

Team Flow at DRC2022: Pipeline System for Travel Destination Recommendation Task in Spoken Dialogue

To improve the interactive capabilities of a dialogue system, e.g., to a...

Please sign up or login with your details

Forgot password? Click here to reset