Words of Estimative Correlation: Studying Verbalizations of Scatterplots

11/28/2019
by   Rafael Henkin, et al.
0

Multimodal approaches where interactive visualization and natural language are used in tandem are emerging as promising techniques for data analysis. A significant barrier in effectively designing such multimodal techniques is the lack of a systematic understanding of how people verbalize visual representations of data. Motivated by these gaps, this paper devises and applies a transferable, semi-automated methodology to systematically study the relation between visualization and natural language through two crowd-sourced experiments and natural language analysis. We describe these experiments, analyze the resulting corpus of utterances with natural language processing techniques and derive an empirically supported semantic lexicon for aligning visualizations and verbalizations of data. Our results indicate a wide range of vocabulary used to describe visualizations and led to a number of high level concepts to categorize the space of words and related utterances. We discuss how our findings can be used for natural language generation, also reflecting on the design of the experiments and the semi-automated methodology used in the analysis. We discuss further research directions and argue for a role for such multimodal experiments in advancing our understanding of how people work with visualizations and also data at large.

READ FULL TEXT

page 5

page 8

page 10

page 12

research
10/01/2021

Collecting and Characterizing Natural Language Utterances for Specifying Data Visualizations

Natural language interfaces (NLIs) for data visualization are becoming i...
research
09/29/2021

Reflexivity in Issues of Scale and Representation in a Digital Humanities Project

In this paper, we explore issues that we have encountered in developing ...
research
08/07/2022

Comparison Conundrum and the Chamber of Visualizations: An Exploration of How Language Influences Visual Design

The language for expressing comparisons is often complex and nuanced, ma...
research
08/24/2020

NL4DV: A Toolkit for Generating Analytic Specifications for Data Visualization from Natural Language Queries

Natural language interfaces (NLIs) have shown great promise for visual d...
research
01/17/2020

InChorus: Designing Consistent Multimodal Interactions for Data Visualization on Tablet Devices

While tablet devices are a promising platform for data visualization, su...
research
10/08/2021

Accessible Visualization via Natural Language Descriptions: A Four-Level Model of Semantic Content

Natural language descriptions sometimes accompany visualizations to bett...
research
07/20/2022

Probablement, Wahrscheinlich, Likely ? A Cross-Language Study of How People Verbalize Probabilities in Icon Array Visualizations

Visualizations today are used across a wide range of languages and cultu...

Please sign up or login with your details

Forgot password? Click here to reset