Image Gradient Decomposition for Parallel and Memory-Efficient Ptychographic Reconstruction

05/12/2022
by   Xiao Wang, et al.
0

Ptychography is a popular microscopic imaging modality for many scientific discoveries and sets the record for highest image resolution. Unfortunately, the high image resolution for ptychographic reconstruction requires significant amount of memory and computations, forcing many applications to compromise their image resolution in exchange for a smaller memory footprint and a shorter reconstruction time. In this paper, we propose a novel image gradient decomposition method that significantly reduces the memory footprint for ptychographic reconstruction by tessellating image gradients and diffraction measurements into tiles. In addition, we propose a parallel image gradient decomposition method that enables asynchronous point-to-point communications and parallel pipelining with minimal overhead on a large number of GPUs. Our experiments on a Titanate material dataset (PbTiO3) with 16632 probe locations show that our Gradient Decomposition algorithm reduces memory footprint by 51 times. In addition, it achieves time-to-solution within 2.2 minutes by scaling to 4158 GPUs with a super-linear speedup at 364 is 2.7 times more memory efficient, 9 times more scalable and 86 times faster than the state-of-the-art algorithm.

READ FULL TEXT

page 1

page 3

page 5

page 6

page 8

page 10

research
11/29/2021

SwiftSRGAN – Rethinking Super-Resolution for Efficient and Real-time Inference

In recent years, there have been several advancements in the task of ima...
research
07/18/2022

GLEAM: Greedy Learning for Large-Scale Accelerated MRI Reconstruction

Unrolled neural networks have recently achieved state-of-the-art acceler...
research
02/19/2021

Symplectic Adjoint Method for Exact Gradient of Neural ODE with Minimal Memory

A neural network model of a differential equation, namely neural ODE, ha...
research
12/14/2018

Parallel and Scalable Heat Methods for Geodesic Distance Computation

In this paper, we propose a parallel and scalable approach for geodesic ...
research
01/20/2022

Accelerating Laue Depth Reconstruction Algorithm with CUDA

The Laue diffraction microscopy experiment uses the polychromatic Laue m...
research
04/25/2022

NVM-ESR: Using Non-Volatile Memory in Exact State Reconstruction of Preconditioned Conjugate Gradient

HPC systems are a critical resource for scientific research and advanced...
research
01/18/2022

Enabling wave-based inversion on GPUs with randomized trace estimation

By building on recent advances in the use of randomized trace estimation...

Please sign up or login with your details

Forgot password? Click here to reset