MPCHAT: Towards Multimodal Persona-Grounded Conversation

05/27/2023
by   Jaewoo Ahn, et al.
0

In order to build self-consistent personalized dialogue agents, previous research has mostly focused on textual persona that delivers personal facts or personalities. However, to fully describe the multi-faceted nature of persona, image modality can help better reveal the speaker's personal characteristics and experiences in episodic memory (Rubin et al., 2003; Conway, 2009). In this work, we extend persona-based dialogue to the multimodal domain and make two main contributions. First, we present the first multimodal persona-based dialogue dataset named MPCHAT, which extends persona with both text and images to contain episodic memories. Second, we empirically show that incorporating multimodal persona, as measured by three proposed multimodal persona-grounded dialogue tasks (i.e., next response prediction, grounding persona prediction, and speaker identification), leads to statistically significant performance improvements across all tasks. Thus, our work highlights that multimodal persona is crucial for improving multimodal dialogue comprehension, and our MPCHAT serves as a high-quality resource for this research.

READ FULL TEXT
research
10/20/2018

Improving Context Modelling in Multimodal Dialogue Generation

In this work, we investigate the task of textual response generation in ...
research
10/20/2018

A Knowledge-Grounded Multimodal Search-Based Conversational Agent

Multimodal search-based dialogue is a challenging new task: It extends v...
research
04/04/2020

Open Domain Dialogue Generation with Latent Images

We consider grounding open domain dialogues with images. Existing work a...
research
07/25/2022

A Multi-Party Dialogue Ressource in French

We present Dialogues in Games (DinG), a corpus of manual transcriptions ...
research
09/10/2023

Collecting Visually-Grounded Dialogue with A Game Of Sorts

An idealized, though simplistic, view of the referring expression produc...
research
02/28/2023

Which One Are You Referring To? Multimodal Object Identification in Situated Dialogue

The demand for multimodal dialogue systems has been rising in various do...
research
02/09/2023

Improving the Generalizability of Collaborative Dialogue Analysis with Multi-Feature Embeddings

Conflict prediction in communication is integral to the design of virtua...

Please sign up or login with your details

Forgot password? Click here to reset