EM Pre-training for Multi-party Dialogue Response Generation

05/21/2023
by   Yiyang Li, et al.
0

Dialogue response generation requires an agent to generate a response according to the current dialogue history, in terms of which two-party dialogues have been well studied, but leaving a great gap for multi-party dialogues at the same time. Different from two-party dialogues where each response is a direct reply to its previous utterance, the addressee of a response utterance should be specified before it is generated in the multi-party scenario. Thanks to the huge amount of two-party conversational data, various pre-trained language models for two-party dialogue response generation have been proposed. However, due to the lack of annotated addressee labels in multi-party dialogue datasets, it is hard to use them to pre-train a response generation model for multi-party dialogues. To tackle this obstacle, we propose an Expectation-Maximization (EM) approach that iteratively performs the expectation steps to generate addressee labels, and the maximization steps to optimize a response generation model. Theoretical analyses and extensive experiments have justified the feasibility and effectiveness of our proposed method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2023

Pre-training Multi-party Dialogue Models with Latent Discourse Inference

Multi-party dialogues are more difficult for models to understand than o...
research
09/29/2022

An Equal-Size Hard EM Algorithm for Diverse Dialogue Generation

Open-domain dialogue systems aim to interact with humans through natural...
research
05/03/2017

A Hybrid Architecture for Multi-Party Conversational Systems

Multi-party Conversational Systems are systems with natural language int...
research
05/31/2019

GSN: A Graph-Structured Network for Multi-Party Dialogues

Existing neural models for dialogue response generation assume that utte...
research
04/29/2020

Utterance Pair Scoring for Noisy Dialogue Data Filtering

Filtering noisy training data is one of the key approaches to improving ...
research
05/22/2023

MADNet: Maximizing Addressee Deduction Expectation for Multi-Party Conversation Generation

Modeling multi-party conversations (MPCs) with graph neural networks has...
research
09/29/2022

ConceptNet infused DialoGPT for Underlying Commonsense Understanding and Reasoning in Dialogue Response Generation

The pre-trained conversational models still fail to capture the implicit...

Please sign up or login with your details

Forgot password? Click here to reset