Representation Learning for Conversational Data using Discourse Mutual Information Maximization

12/04/2021
by   Bishal Santra, et al.
0

Although many pretrained models exist for text or images, there have been relatively fewer attempts to train representations specifically for dialog understanding. Prior works usually relied on finetuned representations based on generic text representation models like BERT or GPT-2. But, existing pretraining objectives do not take the structural information of text into consideration. Although generative dialog models can learn structural features too, we argue that the structure-unaware word-by-word generation is not suitable for effective conversation modeling. We empirically demonstrate that such representations do not perform consistently across various dialog understanding tasks. Hence, we propose a structure-aware Mutual Information based loss-function DMI (Discourse Mutual Information) for training dialog-representation models, that additionally captures the inherent uncertainty in response prediction. Extensive evaluation on nine diverse dialog modeling tasks shows that our proposed DMI-based models outperform strong baselines by significant margins, even with small-scale pretraining. Our models show the most promising performance on the dialog evaluation task DailyDialog++, in both random and adversarial negative scenarios.

READ FULL TEXT
research
06/02/2019

Pretraining Methods for Dialog Context Representation Learning

This paper examines various unsupervised pretraining objectives for lear...
research
03/09/2020

Matching Text with Deep Mutual Information Estimation

Text matching is a core natural language processing research problem. Ho...
research
08/20/2018

Learning deep representations by mutual information estimation and maximization

Many popular representation-learning algorithms use training objectives ...
research
10/10/2022

Transformer-based Localization from Embodied Dialog with Large-scale Pre-training

We address the challenging task of Localization via Embodied Dialog (LED...
research
10/25/2021

Persona Authentication through Generative Dialogue

In this paper we define and investigate the problem of persona authentic...
research
10/05/2020

Discern: Discourse-Aware Entailment Reasoning Network for Conversational Machine Reading

Document interpretation and dialog understanding are the two major chall...
research
04/01/2022

Learning Disentangled Representations of Negation and Uncertainty

Negation and uncertainty modeling are long-standing tasks in natural lan...

Please sign up or login with your details

Forgot password? Click here to reset