Advancing Semi-Supervised Task Oriented Dialog Systems by JSA Learning of Discrete Latent Variable Models

07/25/2022
by   Yucheng Cai, et al.
0

Developing semi-supervised task-oriented dialog (TOD) systems by leveraging unlabeled dialog data has attracted increasing interests. For semi-supervised learning of latent state TOD models, variational learning is often used, but suffers from the annoying high-variance of the gradients propagated through discrete latent variables and the drawback of indirectly optimizing the target log-likelihood. Recently, an alternative algorithm, called joint stochastic approximation (JSA), has emerged for learning discrete latent variable models with impressive performances. In this paper, we propose to apply JSA to semi-supervised learning of the latent state TOD models, which is referred to as JSA-TOD. To our knowledge, JSA-TOD represents the first work in developing JSA based semi-supervised learning of discrete latent variable conditional models for such long sequential generation problems like in TOD systems. Extensive experiments show that JSA-TOD significantly outperforms its variational learning counterpart. Remarkably, semi-supervised JSA-TOD using 20 labels performs close to the full-supervised baseline on MultiWOZ2.1.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/17/2020

A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning

Structured belief states are crucial for user goal tracking and database...
research
01/05/2023

Deep Latent Variable Models for Semi-supervised Paraphrase Generation

This paper explores deep latent variable models for semi-supervised para...
research
11/10/2015

Anchored Discrete Factor Analysis

We present a semi-supervised learning algorithm for learning discrete fa...
research
10/13/2020

Controlling the Interaction Between Generation and Inference in Semi-Supervised Variational Autoencoders Using Importance Weighting

Even though Variational Autoencoders (VAEs) are widely used for semi-sup...
research
06/30/2020

Semi-supervised Sequential Generative Models

We introduce a novel objective for training deep generative time-series ...
research
06/07/2019

Semi-supervised Stochastic Multi-Domain Learning using Variational Inference

Supervised models of NLP rely on large collections of text which closely...
research
05/22/2023

Knowledge-Retrieval Task-Oriented Dialog Systems with Semi-Supervision

Most existing task-oriented dialog (TOD) systems track dialog states in ...

Please sign up or login with your details

Forgot password? Click here to reset