InfiniViz: Interactive Visual Exploration using Progressive Bin Refinement

10/05/2017
by   Niranjan Kamat, et al.
0

Interactive visualizations can accelerate the data analysis loop through near-instantaneous feedback. To achieve interactivity, techniques such as data cubes and sampling are typically employed. While data cubes can speedup querying for moderate-sized datasets, they are ineffective at doing so at a larger scales due to the size of the materialized data cubes. On the other hand, while sampling can help scale to large datasets, it adds sampling error and the associated issues into the process. While increasing accuracy by looking at more data may sometimes be valuable, providing result minutiae might not be necessary if they do not impart additional significant information. Indeed, such details not only incur a higher computational cost, but also tax the cognitive load of the analyst with worthless trivia. To reduce both the computational and cognitive expenses, we introduce InfiniViz. Through a novel result refinement-based querying paradigm, InfiniViz provides error-free results for large datasets by increasing bin resolutions progressively over time. Through real and simulated workloads over real and benchmark datasets, we evaluate and demonstrate InfiniViz's utility at reducing both cognitive and computational costs, while minimizing information loss.

READ FULL TEXT
research
09/28/2020

Using Resource-Rational Analysis to Understand Cognitive Biases in Interactive Data Visualizations

Cognitive biases are systematic errors in judgment. Researchers in data ...
research
05/12/2019

Kyrix: Interactive Visual Data Exploration at Scale

Scalable interactive visual data exploration is crucial in many domains ...
research
08/29/2020

STULL: Unbiased Online Sampling for Visual Exploration of Large Spatiotemporal Data

Online sampling-supported visual analytics is increasingly important, as...
research
07/10/2019

Hillview: A trillion-cell spreadsheet for big data

Hillview is a distributed spreadsheet for browsing very large datasets t...
research
07/20/2023

Investigating minimizing the training set fill distance in machine learning regression

Many machine learning regression methods leverage large datasets for tra...
research
09/12/2018

Investigating the generalizability of EEG-based Cognitive Load Estimation Across Visualizations

We examine if EEG-based cognitive load (CL) estimation is generalizable ...
research
08/17/2022

Sampling Through the Lens of Sequential Decision Making

Sampling is ubiquitous in machine learning methodologies. Due to the gro...

Please sign up or login with your details

Forgot password? Click here to reset