ROIBIN-SZ: Fast and Science-Preserving Compression for Serial Crystallography

06/22/2022
by   Robert Underwood, et al.
0

Crystallography is the leading technique to study atomic structures of proteins and produces enormous volumes of information that can place strains on the storage and data transfer capabilities of synchrotron and free-electron laser light sources. Lossy compression has been identified as a possible means to cope with the growing data volumes; however, prior approaches have not produced sufficient quality at a sufficient rate to meet scientific needs. This paper presents Region Of Interest BINning with SZ lossy compression (ROIBIN-SZ) a novel, parallel, and accelerated compression scheme that separates the dynamically selected preservation of key regions with lossy compression of background information. We perform and present an extensive evaluation of the performance and quality results made by the co-design of this compression scheme. We can achieve up to a 196x and 46.44x compression ratio on lysozyme and selenobiotinyl-streptavidin while preserving the data sufficiently to reconstruct the structure at bandwidths and scales that approach the needs of the upcoming light sources

READ FULL TEXT

page 3

page 9

page 10

page 11

research
07/11/2023

Optimizing Scientific Data Transfer on Globus with Error-bounded Lossy Compression

The increasing volume and velocity of science data necessitate the frequ...
research
06/19/2023

Parallel Data Compression Techniques

With endless amounts of data and very limited bandwidth, fast data compr...
research
05/17/2018

Fixed-PSNR Lossy Compression for Scientific Data

Error-controlled lossy compression has been studied for years because of...
research
02/21/2018

Lossless Compression of Angiogram Foreground with Visual Quality Preservation of Background

By increasing the volume of telemedicine information, the need for medic...
research
06/23/2018

Optimizing Lossy Compression Rate-Distortion from Automatic Online Selection between SZ and ZFP

With ever-increasing volumes of scientific data produced by HPC applicat...
research
01/24/2022

AutoMC: Automated Model Compression based on Domain Knowledge and Progressive search strategy

Model compression methods can reduce model complexity on the premise of ...
research
09/27/2022

Managed Network Services for Exascale Data Movement Across Large Global Scientific Collaborations

Unique scientific instruments designed and operated by large global coll...

Please sign up or login with your details

Forgot password? Click here to reset