A Parallel Data Compression Framework for Large Scale 3D Scientific Data

03/18/2019
by   Panagiotis Hadjidoukas, et al.
0

Large scale simulations of complex systems ranging from climate and astrophysics to crowd dynamics, produce routinely petabytes of data and are projected to reach the zettabytes level in the coming decade. These simulations enable unprecedented insights but at the same their effectiveness is hindered by the enormous data sizes associated with the computational elements and respective output quantities of interest that impose severe constraints on storage and I/O time. In this work, we address these challenges through a novel software framework for scientific data compression. The software (CubismZ) incorporates efficient wavelet based techniques and the state-of-the-art ZFP, SZ and FPZIP floating point compressors. The framework relies on a block-structured data layout, benefits from OpenMP and MPI and targets supercomputers based on multicores. CubismZ can be used as a tool for ex situ (offline) compression of scientific datasets and supports conventional Computational Fluid Dynamics (CFD) file formats. Moreover, it provides a testbed of comparison, in terms of compression factor and peak signal-to-noise ratio, for a number of available data compression methods. The software yields in situ compression ratios of 100x or higher for fluid dynamics data produced by petascale simulations of cloud cavitation collapse using O(10^11) grid cells, with negligible impact on the total simulation time.

READ FULL TEXT
research
01/17/2020

FRaZ: A Generic High-Fidelity Fixed-Ratio Lossy Compression Framework for Scientific Floating-point Data

With ever-increasing volumes of scientific floating-point data being pro...
research
10/11/2018

Data Compression for Environmental Flow Simulations

A wavelet-based method for compression of fluid flow simulation data is ...
research
03/02/2021

Task-parallel in-situ temporal compression of large-scale computational fluid dynamics data

Present day computational fluid dynamics simulations generate extremely ...
research
11/10/2017

In-Depth Exploration of Single-Snapshot Lossy Compression Techniques for N-Body Simulations

In situ lossy compression allowing user-controlled data loss can signifi...
research
07/09/2023

Hierarchical Autoencoder-based Lossy Compression for Large-scale High-resolution Scientific Data

Lossy compression has become an important technique to reduce data size ...
research
12/21/2022

Scalable Hybrid Learning Techniques for Scientific Data Compression

Data compression is becoming critical for storing scientific data becaus...
research
11/27/2021

Exploring Lossy Compressibility through Statistical Correlations of Scientific Datasets

Lossy compression plays a growing role in scientific simulations where t...

Please sign up or login with your details

Forgot password? Click here to reset