A Student-Teacher Architecture for Dialog Domain Adaptation under the Meta-Learning Setting

04/06/2021
by   Kun Qian, et al.
0

Numerous new dialog domains are being created every day while collecting data for these domains is extremely costly since it involves human interactions. Therefore, it is essential to develop algorithms that can adapt to different domains efficiently when building data-driven dialog models. The most recent researches on domain adaption focus on giving the model a better initialization, rather than optimizing the adaptation process. We propose an efficient domain adaptive task-oriented dialog system model, which incorporates a meta-teacher model to emphasize the different impacts between generated tokens with respect to the context. We first train our base dialog model and meta-teacher model adversarially in a meta-learning setting on rich-resource domains. The meta-teacher learns to quantify the importance of tokens under different contexts across different domains. During adaptation, the meta-teacher guides the dialog model to focus on important tokens in order to achieve better adaptation efficiency. We evaluate our model on two multi-domain datasets, MultiWOZ and Google Schema-Guided Dialogue, and achieve state-of-the-art performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/08/2019

Domain Adaptive Dialog Generation via Meta Learning

Domain adaptation is an essential task in dialog system building because...
research
01/29/2021

Few-Shot Domain Adaptation for Grammatical Error Correction via Meta-Learning

Most existing Grammatical Error Correction (GEC) methods based on sequen...
research
02/22/2021

Domain Adaptation in Dialogue Systems using Transfer and Meta-Learning

Current generative-based dialogue systems are data-hungry and fail to ad...
research
06/03/2020

Meta Dialogue Policy Learning

Dialog policy determines the next-step actions for agents and hence is c...
research
12/23/2016

A Base Camp for Scaling AI

Modern statistical machine learning (SML) methods share a major limitati...
research
09/07/2023

Learning from Limited Heterogeneous Training Data: Meta-Learning for Unsupervised Zero-Day Web Attack Detection across Web Domains

Recently unsupervised machine learning based systems have been developed...
research
07/11/2023

OntoChatGPT Information System: Ontology-Driven Structured Prompts for ChatGPT Meta-Learning

This research presents a comprehensive methodology for utilizing an onto...

Please sign up or login with your details

Forgot password? Click here to reset