Toward Self-Learning End-to-End Dialog Systems

01/18/2022
by   Xiaoying Zhang, et al.
0

End-to-end task-oriented dialog systems often suffer from out-of-distribution (OOD) inputs after being deployed in dynamic, changing, and open environments. In this work, we propose SL-Agent, a self-learning framework that combines supervised learning, reinforcement learning, and machine teaching for building end-to-end dialog systems in a more realistic changing environment setting. SL-Agent consists of a dialog model and a pre-trained reward model to judge the quality of a system response. SL-Agent enables dialog agents to automatically adapt to environments with user behavior changes by learning from human-bot interactions via reinforcement learning, with the incorporated pre-trained reward model. We validate SL-Agent in four different dialog domains. Experimental results show the effectiveness of SL-Agent for automatically adapting to changing environments using both automatic and human evaluations. Furthermore, experiments on a challenging domain extension setting demonstrate that SL-Agent can effectively adapt to new tasks using limited human corrections provided via machine teaching. We will release code, data, and pre-trained models for further research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/11/2020

SOLOIST: Few-shot Task-Oriented Dialog with A Single Pre-trained Auto-regressive Model

This paper presents a new method SOLOIST, which uses transfer learning t...
research
04/18/2019

ConvLab: Multi-Domain End-to-End Dialog System Platform

We present ConvLab, an open-source multi-domain end-to-end dialog system...
research
09/20/2021

Two Approaches to Building Collaborative, Task-Oriented Dialog Agents through Self-Play

Task-oriented dialog systems are often trained on human/human dialogs, s...
research
04/28/2018

Sentiment Adaptive End-to-End Dialog Systems

End-to-end learning framework is useful for building dialog systems for ...
research
10/21/2021

SYNERGY: Building Task Bots at Scale Using Symbolic Knowledge and Machine Teaching

In this paper we explore the use of symbolic knowledge and machine teach...
research
05/13/2020

Towards Automatic building of Human-Machine Conversational System to support Maintenance Processes

Companies are dealing with many cognitive changes with the introduction ...
research
07/17/2019

Learning End-to-End Goal-Oriented Dialog with Maximal User Task Success and Minimal Human Agent Use

Neural end-to-end goal-oriented dialog systems showed promise to reduce ...

Please sign up or login with your details

Forgot password? Click here to reset