Efficient Data Compression for 3D Sparse TPC via Bicephalous Convolutional Autoencoder

11/09/2021
by   Yi Huang, et al.
0

Real-time data collection and analysis in large experimental facilities present a great challenge across multiple domains, including high energy physics, nuclear physics, and cosmology. To address this, machine learning (ML)-based methods for real-time data compression have drawn significant attention. However, unlike natural image data, such as CIFAR and ImageNet that are relatively small-sized and continuous, scientific data often come in as three-dimensional data volumes at high rates with high sparsity (many zeros) and non-Gaussian value distribution. This makes direct application of popular ML compression methods, as well as conventional data compression methods, suboptimal. To address these obstacles, this work introduces a dual-head autoencoder to resolve sparsity and regression simultaneously, called Bicephalous Convolutional AutoEncoder (BCAE). This method shows advantages both in compression fidelity and ratio compared to traditional data compression methods, such as MGARD, SZ, and ZFP. To achieve similar fidelity, the best performer among the traditional methods can reach only half the compression ratio of BCAE. Moreover, a thorough ablation study of the BCAE method shows that a dedicated segmentation decoder improves the reconstruction.

READ FULL TEXT

page 1

page 2

page 3

research
02/28/2020

Improved Image Coding Autoencoder With Deep Learning

In this paper, we build autoencoder based pipelines for extreme end-to-e...
research
10/20/2022

Machine-Learning Compression for Particle Physics Discoveries

In collider-based particle and nuclear physics experiments, data are pro...
research
07/09/2023

Hierarchical Autoencoder-based Lossy Compression for Large-scale High-resolution Scientific Data

Lossy compression has become an important technique to reduce data size ...
research
05/03/2018

Polynomial data compression for large-scale physics experiments

The new generation research experiments will introduce huge data surge t...
research
04/25/2018

Deep Convolutional AutoEncoder-based Lossy Image Compression

Image compression has been investigated as a fundamental research topic ...
research
06/11/2020

Extreme data compression while searching for new physics

Bringing a high-dimensional dataset into science-ready shape is a formid...
research
01/10/2022

A Physics-Informed Vector Quantized Autoencoder for Data Compression of Turbulent Flow

Analyzing large-scale data from simulations of turbulent flows is memory...

Please sign up or login with your details

Forgot password? Click here to reset