Visual Analysis of Large Multivariate Scattered Data using Clustering and Probabilistic Summaries

08/21/2020
by   Tobias Rapp, et al.
0

Rapidly growing data sizes of scientific simulations pose significant challenges for interactive visualization and analysis techniques. In this work, we propose a compact probabilistic representation to interactively visualize large scattered datasets. In contrast to previous approaches that represent blocks of volumetric data using probability distributions, we model clusters of arbitrarily structured multivariate data. In detail, we discuss how to efficiently represent and store a high-dimensional distribution for each cluster. We observe that it suffices to consider low-dimensional marginal distributions for two or three data dimensions at a time to employ common visual analysis techniques. Based on this observation, we represent high-dimensional distributions by combinations of low-dimensional Gaussian mixture models. We discuss the application of common interactive visual analysis techniques to this representation. In particular, we investigate several frequency-based views, such as density plots in 1D and 2D, density-based parallel coordinates, and a time histogram. We visualize the uncertainty introduced by the representation, discuss a level-of-detail mechanism, and explicitly visualize outliers. Furthermore, we propose a spatial visualization by splatting anisotropic 3D Gaussians for which we derive a closed-form solution. Lastly, we describe the application of brushing and linking to this clustered representation. Our evaluation on several large, real-world datasets demonstrates the scaling of our approach.

READ FULL TEXT

page 1

page 6

page 7

page 9

research
07/22/2022

Fiber Uncertainty Visualization for Bivariate Data With Parametric and Nonparametric Noise Models

Visualization and analysis of multivariate data and their uncertainty ar...
research
07/19/2022

AccuStripes: Adaptive Binning for the Visual Comparison of Univariate Data Distributions

Understanding and comparing distributions of data (e.g., regarding their...
research
08/09/2014

Warped Mixtures for Nonparametric Cluster Shapes

A mixture of Gaussians fit to a single curved or heavy-tailed cluster wi...
research
04/03/2023

Gaussian model for closed curves

Gaussian Mixture Models (GMM) do not adapt well to curved and strongly n...
research
07/05/2019

Visualization of Emergency Department Clinical Data for Interpretable Patient Phenotyping

Visual summarization of clinical data collected on patients contained wi...
research
02/19/2014

Analysis of Multibeam SONAR Data using Dissimilarity Representations

This paper considers the problem of low-dimensional visualisation of ver...
research
12/24/2012

Reconstructing Self Organizing Maps as Spider Graphs for better visual interpretation of large unstructured datasets

Self-Organizing Maps (SOM) are popular unsupervised artificial neural ne...

Please sign up or login with your details

Forgot password? Click here to reset