Reference Resolution and Context Change in Multimodal Situated Dialogue for Exploring Data Visualizations

09/06/2022
by   Barbara Di Eugenio, et al.
1

Reference resolution, which aims to identify entities being referred to by a speaker, is more complex in real world settings: new referents may be created by processes the agents engage in and/or be salient only because they belong to the shared physical setting. Our focus is on resolving references to visualizations on a large screen display in multimodal dialogue; crucially, reference resolution is directly involved in the process of creating new visualizations. We describe our annotations for user references to visualizations appearing on a large screen via language and hand gesture and also new entity establishment, which results from executing the user request to create a new visualization. We also describe our reference resolution pipeline which relies on an information-state architecture to maintain dialogue context. We report results on detecting and resolving references, effectiveness of contextual information on the model, and under-specified requests for creating visualizations. We also experiment with conventional CRF and deep learning / transformer models (BiLSTM-CRF and BERT-CRF) for tagging references in user utterance text. Our results show that transfer learning significantly boost performance of the deep learning methods, although CRF still out-performs them, suggesting that conventional methods may generalize better for low resource data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/06/2020

Speaker-change Aware CRF for Dialogue Act Classification

Recent work in Dialogue Act (DA) classification approaches the task as a...
research
11/09/2020

Refer, Reuse, Reduce: Generating Subsequent References in Visual and Conversational Contexts

Dialogue participants often refer to entities or situations repeatedly w...
research
05/24/2022

Scoring Coreference Chains with Split-Antecedent Anaphors

Anaphoric reference is an aspect of language interpretation covering a v...
research
06/26/2014

Communicating and resolving entity references

Statements about entities occur everywhere, from newspapers and web page...
research
09/28/2011

Cognitive Principles in Robust Multimodal Interpretation

Multimodal conversational interfaces provide a natural means for users t...
research
06/14/2023

Chart2Vec: A Universal Embedding of Context-Aware Visualizations

The advances in AI-enabled techniques have accelerated the creation and ...

Please sign up or login with your details

Forgot password? Click here to reset