MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation

11/10/2022
by   Jiazhan Feng, et al.
0

Responding with multi-modal content has been recognized as an essential capability for an intelligent conversational agent. In this paper, we introduce the MMDialog dataset to better facilitate multi-modal conversation. MMDialog is composed of a curated set of 1.08 million real-world dialogues with 1.53 million unique images across 4,184 topics. MMDialog has two main and unique advantages. First, it is the largest multi-modal conversation dataset by the number of dialogues by 88x. Second, it contains massive topics to generalize the open-domain. To build engaging dialogue system with this dataset, we propose and normalize two response producing tasks based on retrieval and generative scenarios. In addition, we build two baselines for above tasks with state-of-the-art techniques and report their experimental performance. We also propose a novel evaluation metric MM-Relevance to measure the multi-modal responses. Our dataset and scripts are available in https://github.com/victorsungo/MMDialog.

READ FULL TEXT

page 1

page 13

research
09/27/2021

OpenViDial 2.0: A Larger-Scale, Open-Domain Dialogue Generation Dataset with Visual Contexts

In order to better simulate the real human conversation process, models ...
research
08/31/2023

AntM^2C: A Large Scale Dataset For Multi-Scenario Multi-Modal CTR Prediction

Click-through rate (CTR) prediction is a crucial issue in recommendation...
research
05/23/2022

META-GUI: Towards Multi-modal Conversational Agents on Mobile GUI

Task-oriented dialogue (TOD) systems have been widely used by mobile pho...
research
07/03/2023

JourneyDB: A Benchmark for Generative Image Understanding

While recent advancements in vision-language models have revolutionized ...
research
12/16/2022

On Safe and Usable Chatbots for Promoting Voter Participation

Chatbots, or bots for short, are multi-modal collaborative assistants th...
research
03/26/2018

On Chatbots Exhibiting Goal-Directed Autonomy in Dynamic Environments

Conversation interfaces (CIs), or chatbots, are a popular form of intell...
research
08/16/2021

MMChat: Multi-Modal Chat Dataset on Social Media

Incorporating multi-modal contexts in conversation is an important step ...

Please sign up or login with your details

Forgot password? Click here to reset