Parallel Discrete Convolutions on Adaptive Particle Representations of Images

12/07/2021
by   Joel Jonsson, et al.
2

We present data structures and algorithms for native implementations of discrete convolution operators over Adaptive Particle Representations (APR) of images on parallel computer architectures. The APR is a content-adaptive image representation that locally adapts the sampling resolution to the image signal. It has been developed as an alternative to pixel representations for large, sparse images as they typically occur in fluorescence microscopy. It has been shown to reduce the memory and runtime costs of storing, visualizing, and processing such images. This, however, requires that image processing natively operates on APRs, without intermediately reverting to pixels. Designing efficient and scalable APR-native image processing primitives, however, is complicated by the APR's irregular memory structure. Here, we provide the algorithmic building blocks required to efficiently and natively process APR images using a wide range of algorithms that can be formulated in terms of discrete convolutions. We show that APR convolution naturally leads to scale-adaptive algorithms that efficiently parallelize on multi-core CPU and GPU architectures. We quantify the speedups in comparison to pixel-based algorithms and convolutions on evenly sampled data. We achieve pixel-equivalent throughputs of up to 1 TB/s on a single Nvidia GeForce RTX 2080 gaming GPU, requiring up to two orders of magnitude less memory than a pixel-based implementation.

READ FULL TEXT

page 1

page 3

page 5

page 7

page 8

page 10

page 11

page 13

research
01/19/2018

Fully Point-wise Convolutional Neural Network for Modeling Statistical Regularities in Natural Images

Modeling statistical regularities is the problem of representing the pix...
research
01/02/2020

A Machine Learning Imaging Core using Separable FIR-IIR Filters

We propose fixed-function neural network hardware that is designed to pe...
research
10/23/2021

HWTool: Fully Automatic Mapping of an Extensible C++ Image Processing Language to Hardware

Implementing image processing algorithms using FPGAs or ASICs can improv...
research
10/11/2012

Computationally Efficient Implementation of Convolution-based Locally Adaptive Binarization Techniques

One of the most important steps of document image processing is binariza...
research
08/24/2023

Alternative quadrant representations with Morton index and AVX2 vectorization for AMR algorithms within the p4est software library

We present a technical enhancement within the p4est software for paralle...
research
04/13/2021

PAGANI: A Parallel Adaptive GPU Algorithm for Numerical

We present a new adaptive parallel algorithm for the challenging problem...
research
03/23/2022

Pixel VQ-VAEs for Improved Pixel Art Representation

Machine learning has had a great deal of success in image processing. Ho...

Please sign up or login with your details

Forgot password? Click here to reset