UniPCM: Universal Pre-trained Conversation Model with Task-aware Automatic Prompt

09/20/2023
by   Yucheng Cai, et al.
0

Recent research has shown that multi-task pre-training greatly improves the model's robustness and transfer ability, which is crucial for building a high-quality dialog system. However, most previous works on multi-task pre-training rely heavily on human-defined input format or prompt, which is not optimal in quality and quantity. In this work, we propose to use Task-based Automatic Prompt generation (TAP) to automatically generate high-quality prompts. Using the high-quality prompts generated, we scale the corpus of the pre-trained conversation model to 122 datasets from 15 dialog-related tasks, resulting in Universal Pre-trained Conversation Model (UniPCM), a powerful foundation model for various conversational tasks and different dialog systems. Extensive experiments have shown that UniPCM is robust to input prompts and capable of various dialog-related tasks. Moreover, UniPCM has strong transfer ability and excels at low resource scenarios, achieving SOTA results on 9 different datasets ranging from task-oriented dialog to open-domain conversation. Furthermore, we are amazed to find that TAP can generate prompts on par with those collected with crowdsourcing. The code is released with the paper.

READ FULL TEXT

page 3

page 5

research
04/24/2020

A Tailored Pre-Training Model for Task-Oriented Dialog Generation

The recent success of large pre-trained language models such as BERT and...
research
11/29/2021

GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learning and Explicit Policy Injection

Pre-trained models have proved to be powerful in enhancing task-oriented...
research
06/29/2023

UMASS_BioNLP at MEDIQA-Chat 2023: Can LLMs generate high-quality synthetic note-oriented doctor-patient conversations?

This paper presents UMASS_BioNLP team participation in the MEDIQA-Chat 2...
research
05/20/2021

Towards Detecting Need for Empathetic Response in Motivational Interviewing

Empathetic response from the therapist is key to the success of clinical...
research
02/26/2022

AugESC: Large-scale Data Augmentation for Emotional Support Conversation with Pre-trained Language Models

Crowd-sourcing is commonly adopted for dialog data collection. However, ...
research
12/15/2021

Database Search Results Disambiguation for Task-Oriented Dialog Systems

As task-oriented dialog systems are becoming increasingly popular in our...
research
05/23/2023

Effortless Integration of Memory Management into Open-Domain Conversation Systems

Open-domain conversation systems integrate multiple conversation skills ...

Please sign up or login with your details

Forgot password? Click here to reset