AugESC: Large-scale Data Augmentation for Emotional Support Conversation with Pre-trained Language Models

02/26/2022
by   Chujie Zheng, et al.
0

Crowd-sourcing is commonly adopted for dialog data collection. However, it is highly costly and time-consuming, and the collected data is limited in scale and topic coverage. In this paper, aiming to generate emotional support conversations, we propose exploiting large-scale pre-trained language models for data augmentation, and provide key findings in our pilot exploration. Our adopted approach leverages the 6B-parameter GPT-J model and utilizes publicly available dialog posts to trigger conversations on various topics. Then we construct AugESC, a machine-augmented dataset for emotional support conversation. It is two orders of magnitude larger than the original ESConv dataset in scale, covers more diverse topics, and is shown to be of high quality by human evaluation. Lastly, we demonstrate with interactive evaluation that AugESC can further enhance dialog models tuned on ESConv to handle various conversation topics and to provide significantly more effective emotional support.

READ FULL TEXT
research
06/02/2021

Towards Emotional Support Dialog Systems

Emotional support is a crucial ability for many conversation scenarios, ...
research
06/07/2021

Summary Grounded Conversation Generation

Many conversation datasets have been constructed in the recent years usi...
research
10/09/2019

Alternating Recurrent Dialog Model with Large-scale Pre-trained Language Models

Existing dialog system models require extensive human annotations and ar...
research
09/20/2023

UniPCM: Universal Pre-trained Conversation Model with Task-aware Automatic Prompt

Recent research has shown that multi-task pre-training greatly improves ...
research
08/30/2023

Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning Based on Visually Grounded Conversations

We introduce Affective Visual Dialog, an emotion explanation and reasoni...
research
04/17/2019

Reinforcement Learning Based Emotional Editing Constraint Conversation Generation

In recent years, the generation of conversation content based on deep ne...
research
05/13/2020

Large Scale Multi-Actor Generative Dialog Modeling

Non-goal oriented dialog agents (i.e. chatbots) aim to produce varying a...

Please sign up or login with your details

Forgot password? Click here to reset