WYTIWYR: A User Intent-Aware Framework with Multi-modal Inputs for Visualization Retrieval

04/14/2023
by   Shishi Xiao, et al.
0

Retrieving charts from a large corpus is a fundamental task that can benefit numerous applications such as visualization recommendations.The retrieved results are expected to conform to both explicit visual attributes (e.g., chart type, colormap) and implicit user intents (e.g., design style, context information) that vary upon application scenarios. However, existing example-based chart retrieval methods are built upon non-decoupled and low-level visual features that are hard to interpret, while definition-based ones are constrained to pre-defined attributes that are hard to extend. In this work, we propose a new framework, namely WYTIWYR (What-You-Think-Is-What-You-Retrieve), that integrates user intents into the chart retrieval process. The framework consists of two stages: first, the Annotation stage disentangles the visual attributes within the bitmap query chart; and second, the Retrieval stage embeds the user's intent with customized text prompt as well as query chart, to recall targeted retrieval result. We develop a prototype WYTIWYR system leveraging a contrastive language-image pre-training (CLIP) model to achieve zero-shot classification, and test the prototype on a large corpus with charts crawled from the Internet. Quantitative experiments, case studies, and qualitative interviews are conducted. The results demonstrate the usability and effectiveness of our proposed framework.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

page 8

page 9

page 10

research
01/17/2023

Learning Customized Visual Models with Retrieval-Augmented Knowledge

Image-text contrastive learning models such as CLIP have demonstrated st...
research
02/12/2022

Structure-aware Visualization Retrieval

With the wide usage of data visualizations, a huge number of Scalable Ve...
research
07/31/2020

Retrieve-Then-Adapt: Example-based Automatic Generation for Proportion-related Infographics

Infographic is a data visualization technique which combines graphic and...
research
05/24/2023

Pre-training Intent-Aware Encoders for Zero- and Few-Shot Intent Classification

Intent classification (IC) plays an important role in task-oriented dial...
research
09/15/2022

Exploring Visual Interpretability for Contrastive Language-Image Pre-training

Contrastive Language-Image pre-training (CLIP) learns rich representatio...
research
09/13/2018

Interpreting search result rankings through intent modeling

Given the recent interest in arguably accurate yet non-interpretable neu...
research
12/13/2015

An Uncertainty-Aware Approach for Exploratory Microblog Retrieval

Although there has been a great deal of interest in analyzing customer o...

Please sign up or login with your details

Forgot password? Click here to reset