ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format

11/30/2022
by   Qi Zhu, et al.
0

Diverse data formats and ontologies of task-oriented dialogue (TOD) datasets hinder us from developing general dialogue models that perform well on many datasets and studying knowledge transfer between datasets. To address this issue, we present ConvLab-3, a flexible dialogue system toolkit based on a unified TOD data format. In ConvLab-3, different datasets are transformed into one unified format and loaded by models in the same way. As a result, the cost of adapting a new model or dataset is significantly reduced. Compared to the previous releases of ConvLab (Lee et al., 2019b; Zhu et al., 2020b), ConvLab-3 allows developing dialogue systems with much more datasets and enhances the utility of the reinforcement learning (RL) toolkit for dialogue policies. To showcase the use of ConvLab-3 and inspire future work, we present a comprehensive study with various settings. We show the benefit of pre-training on other datasets for few-shot fine-tuning and RL, and encourage evaluating policy with diverse user simulators.

READ FULL TEXT
research
08/14/2020

Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems

Task-oriented dialogue systems use four connected modules, namely, Natur...
research
07/19/2023

DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI

Despite advancements in conversational AI, language models encounter cha...
research
07/12/2019

Hello, It's GPT-2 -- How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems

Data scarcity is a long-standing and crucial challenge that hinders quic...
research
11/04/2022

MultiWOZ-DF – A Dataflow implementation of the MultiWOZ dataset

Semantic Machines (SM) have introduced the use of the dataflow (DF) para...
research
11/29/2017

A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management

Dialogue assistants are rapidly becoming an indispensable daily aid. To ...
research
05/04/2023

Re^3Dial: Retrieve, Reorganize and Rescale Dialogue Corpus for Long-Turn Open-Domain Dialogue Pre-training

Large-scale open-domain dialogue data crawled from public social media h...
research
07/25/2022

A Multi-Party Dialogue Ressource in French

We present Dialogues in Games (DinG), a corpus of manual transcriptions ...

Please sign up or login with your details

Forgot password? Click here to reset