Quda: Natural Language Queries for Visual Data Analytics

05/07/2020
by   Siwei Fu, et al.
0

Visualization-oriented natural language interfaces (V-NLIs) have been explored and developed in recent years. One challenge faced by V-NLIs is in the formation of effective design decisions that usually requires a deep understanding of user queries. Learning-based approaches have shown potential in V-NLIs and reached state-of-the-art performance in various NLP tasks. However, because of the lack of sufficient training samples that cater to visual data analytics, cutting-edge techniques have rarely been employed to facilitate the development of V-NLIs. We present a new dataset, called Quda, to help V-NLIs understand free-form natural language. Our dataset contains 14;035 diverse user queries annotated with 10 low-level analytic tasks that assist in the deployment of state-of-the-art techniques for parsing complex human language. We achieve this goal by first gathering seed queries with data analysts who are target users of V-NLIs. Then we employ extensive crowd force for paraphrase generation and validation. We demonstrate the usefulness of Quda in building V-NLIs by creating a prototype that makes effective design decisions for free-form user queries. We also show that Quda can be beneficial for a wide range of applications in the visualization community by analyzing the design tasks described in academic publications.

READ FULL TEXT

page 1

page 5

page 8

page 9

research
08/24/2020

NL4DV: A Toolkit for Generating Analytic Specifications for Data Visualization from Natural Language Queries

Natural language interfaces (NLIs) have shown great promise for visual d...
research
09/08/2021

Towards Natural Language Interfaces for Data Visualization: A Survey

Utilizing Visualization-oriented Natural Language Interfaces (V-NLI) as ...
research
12/26/2022

Natural Language Interfaces to Data

Recent advances in NLU and NLP have resulted in renewed interest in natu...
research
12/24/2021

nvBench: A Large-Scale Synthesized Dataset for Cross-Domain Natural Language to Visualization Task

NL2VIS - which translates natural language (NL) queries to corresponding...
research
07/01/2022

Facilitating Conversational Interaction in Natural Language Interfaces for Visualization

Natural language (NL) toolkits enable visualization developers, who may ...
research
06/22/2019

Semantically Driven Auto-completion

The Bloomberg Terminal has been a leading source of financial data and a...
research
08/18/2020

VizCommender: Computing Text-Based Similarity in Visualization Repositories for Content-Based Recommendations

Cloud-based visualization services have made visual analytics accessible...

Please sign up or login with your details

Forgot password? Click here to reset