Multivariate Pointwise Information-Driven Data Sampling and Visualization

07/26/2019
by   Soumya Dutta, et al.
0

With increasing computing capabilities of modern supercomputers, the size of the data generated from the scientific simulations is growing rapidly. As a result, application scientists need effective data summarization techniques that can reduce large-scale multivariate spatiotemporal data sets while preserving the important data properties so that the reduced data can answer domain-specific queries involving multiple variables with sufficient accuracy. While analyzing complex scientific events, domain experts often analyze and visualize two or more variables together to obtain a better understanding of the characteristics of the data features. Therefore, data summarization techniques are required to analyze multi-variable relationships in detail and then perform data reduction such that the important features involving multiple variables are preserved in the reduced data. To achieve this, in this work, we propose a data sub-sampling algorithm for performing statistical data summarization that leverages pointwise information theoretic measures to quantify the statistical association of data points considering multiple variables and generates a sub-sampled data that preserves the statistical association among multi-variables. Using such reduced sampled data, we show that multivariate feature query and analysis can be done effectively. The efficacy of the proposed multivariate association driven sampling algorithm is presented by applying it on several scientific data sets.

READ FULL TEXT

page 5

page 11

page 15

page 19

page 20

page 21

page 24

page 26

research
08/31/2020

Relationship-aware Multivariate Sampling Strategy for Scientific Simulation Data

With the increasing computational power of current supercomputers, the s...
research
08/19/2019

A Co-analysis Framework for Exploring Multivariate Scientific Data

In complex multivariate data sets, different features usually include di...
research
08/18/2019

Multivariate Spatial Data Visualization: A Survey

Multivariate spatial data plays an important role in computational scien...
research
07/28/2023

Multivariate Differential Association Analysis

Identifying how dependence relationships vary across different condition...
research
09/10/2021

Unsupervised classification of simulated magnetospheric regions

In magnetospheric missions, burst mode data sampling should be triggered...
research
09/11/2020

Visual Neural Decomposition to Explain Multivariate Data Sets

Investigating relationships between variables in multi-dimensional data ...
research
03/29/2018

The use of fourth order cumulant tensors to detect outlier features modelled by a t-Student copula

In this paper we use multivariate cumulant of order 4 to distinguish bet...

Please sign up or login with your details

Forgot password? Click here to reset