STULL: Unbiased Online Sampling for Visual Exploration of Large Spatiotemporal Data

08/29/2020
by   Guizhen Wang, et al.
0

Online sampling-supported visual analytics is increasingly important, as it allows users to explore large datasets with acceptable approximate answers at interactive rates. However, existing online spatiotemporal sampling techniques are often biased, as most researchers have primarily focused on reducing computational latency. Biased sampling approaches select data with unequal probabilities and produce results that do not match the exact data distribution, leading end users to incorrect interpretations. In this paper, we propose a novel approach to perform unbiased online sampling of large spatiotemporal data. The proposed approach ensures the same probability of selection to every point that qualifies the specifications of a user's multidimensional query. To achieve unbiased sampling for accurate representative interactive visualizations, we design a novel data index and an associated sample retrieval plan. Our proposed sampling approach is suitable for a wide variety of visual analytics tasks, e.g., tasks that run aggregate queries of spatiotemporal data. Extensive experiments confirm the superiority of our approach over a state-of-the-art spatial online sampling technique, demonstrating that within the same computational time, data samples generated in our approach are at least 50 spatial distribution of the data and enable approximate visualizations to present closer visual appearances to the exact ones.

READ FULL TEXT

page 2

page 3

page 4

page 5

page 6

page 9

page 11

page 12

research
08/10/2022

A Comparison of Spatiotemporal Visualizations for 3D Urban Analytics

Recent technological innovations have led to an increase in the availabi...
research
11/15/2018

Model-based Approximate Query Processing

Interactive visualizations are arguably the most important tool to explo...
research
07/26/2019

SCATTERSEARCH: Visual Querying of Scatterplot Visualizations

Scatterplots are one of the simplest and most commonly-used visualizatio...
research
10/05/2017

InfiniViz: Interactive Visual Exploration using Progressive Bin Refinement

Interactive visualizations can accelerate the data analysis loop through...
research
05/12/2019

Kyrix: Interactive Visual Data Exploration at Scale

Scalable interactive visual data exploration is crucial in many domains ...
research
12/17/2019

Mosaic: A Sample-Based Database System for Open World Query Processing

Data scientists have relied on samples to analyze populations of interes...
research
11/22/2022

BASM: A Bottom-up Adaptive Spatiotemporal Model for Online Food Ordering Service

Online Food Ordering Service (OFOS) is a popular location-based service ...

Please sign up or login with your details

Forgot password? Click here to reset