Foresight: Rapid Data Exploration Through Guideposts

09/29/2017
by   Çağatay Demiralp, et al.
0

Current tools for exploratory data analysis (EDA) require users to manually select data attributes, statistical computations and visual encodings. This can be daunting for large-scale, complex data. We introduce Foresight, a visualization recommender system that helps the user rapidly explore large high-dimensional datasets through "guideposts." A guidepost is a visualization corresponding to a pronounced instance of a statistical descriptor of the underlying data, such as a strong linear correlation between two attributes, high skewness or concentration about the mean of a single attribute, or a strong clustering of values. For each descriptor, Foresight initially presents visualizations of the "strongest" instances, based on an appropriate ranking metric. Given these initial guideposts, the user can then look at "nearby" guideposts by issuing "guidepost queries" containing constraints on metric type, metric strength, data attributes, and data values. Thus, the user can directly explore the network of guideposts, rather than the overwhelming space of data attributes and visual encodings. Foresight also provides for each descriptor a global visualization of ranking-metric values to both help orient the user and ensure a thorough exploration process. Foresight facilitates interactive exploration of large datasets using fast, approximate sketching to compute ranking metrics. We also contribute insights on EDA practices of data scientists, summarizing results from an interview study we conducted to inform the design of Foresight.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/12/2017

Foresight: Recommending Visual Insights

Current tools for exploratory data analysis (EDA) require users to manua...
research
04/09/2018

Clustrophile 2: Guided Visual Clustering Analysis

Data clustering is a common unsupervised learning method frequently used...
research
08/29/2022

SemanticAxis: Exploring Multi-attribute Data by Semantics Construction and Ranking Analysis

Mining the distribution of features and sorting items by combined attrib...
research
12/10/2017

Exploration of User Groups in VEXUS

We introduce VEXUS, an interactive visualization framework for exploring...
research
08/06/2021

Lumos: Increasing Awareness of Analytic Behavior during Visual Data Analysis

Visual data analysis tools provide people with the agency and flexibilit...
research
07/30/2020

Composition and Configuration Patterns in Multiple-View Visualizations

Multiple-view visualization (MV) is a layout design technique often empl...
research
10/10/2019

Visual Understanding of Multiple Attributes Learning Model of X-Ray Scattering Images

This extended abstract presents a visualization system, which is designe...

Please sign up or login with your details

Forgot password? Click here to reset