The PhotoBook Dataset: Building Common Ground through Visually-Grounded Dialogue

06/04/2019
by   Janosch Haber, et al.
0

This paper introduces the PhotoBook dataset, a large-scale collection of visually-grounded, task-oriented dialogues in English designed to investigate shared dialogue history accumulating during conversation. Taking inspiration from seminal work on dialogue analysis, we propose a data-collection task formulated as a collaborative game prompting two online participants to refer to images utilising both their visual context as well as previously established referring expressions. We provide a detailed description of the task setup and a thorough analysis of the 2,500 dialogues collected. To further illustrate the novel features of the dataset, we propose a baseline model for reference resolution which uses a simple method to take into account shared information accumulated in a reference chain. Our results show that this information is particularly important to resolve later descriptions and underline the need to develop more sophisticated models of common ground in dialogue interaction.

READ FULL TEXT

page 3

page 9

page 12

page 13

page 15

page 16

research
09/10/2023

Collecting Visually-Grounded Dialogue with A Game Of Sorts

An idealized, though simplistic, view of the referring expression produc...
research
06/16/2023

Listener Model for the PhotoBook Referential Game with CLIPScores as Implicit Reference Chain

PhotoBook is a collaborative dialogue game where two players receive pri...
research
09/01/2019

What You See is What You Get: Visual Pronoun Coreference Resolution in Dialogues

Grounding a pronoun to a visual object it refers to requires complex rea...
research
11/09/2020

Refer, Reuse, Reduce: Generating Subsequent References in Visual and Conversational Contexts

Dialogue participants often refer to entities or situations repeatedly w...
research
05/29/2021

Maintaining Common Ground in Dynamic Environments

Common grounding is the process of creating and maintaining mutual under...
research
04/14/2022

Can Visual Dialogue Models Do Scorekeeping? Exploring How Dialogue Representations Incrementally Encode Shared Knowledge

Cognitively plausible visual dialogue models should keep a mental scoreb...
research
02/11/2018

FlipDial: A Generative Model for Two-Way Visual Dialogue

We present FlipDial, a generative model for visual dialogue that simulta...

Please sign up or login with your details

Forgot password? Click here to reset