Fast Implementation of Morphological Filtering Using ARM NEON Extension

02/19/2020
by   Elena Limonova, et al.
0

In this paper we consider speedup potential of morphological image filtering on ARM processors. Morphological operations are widely used in image analysis and recognition and their speedup in some cases can significantly reduce overall execution time of recognition. More specifically, we propose fast implementation of erosion and dilation using ARM SIMD extension NEON. These operations with the rectangular structuring element are separable. They were implemented using the advantages of separability as sequential horizontal and vertical passes. Each pass was implemented using van Herk/Gil-Werman algorithm for large windows and low-constant linear complexity algorithm for small windows. Final implementation was improved with SIMD and used a combination of these methods. We also considered fast transpose implementation of 8x8 and 16x16 matrices using ARM NEON to get additional computational gain for morphological operations. Experiments showed 3 times efficiency increase for final implementation of erosion and dilation compared to van Herk/Gil-Werman algorithm without SIMD, 5.7 times speedup for 8x8 matrix transpose and 12 times speedup for 16x16 matrix transpose compared to transpose without SIMD.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/17/2021

A fast vectorized sorting implementation based on the ARM scalable vector extension (SVE)

The way developers implement their algorithms and how these implementati...
research
05/18/2022

Fast matrix multiplication for binary and ternary CNNs on ARM CPU

Low-bit quantized neural networks are of great interest in practical app...
research
12/16/2020

Parallel Implementation of Distributed Global Optimization (DGO)

Parallel implementations of distributed global optimization (DGO) [13] o...
research
03/03/2022

ARM 4-BIT PQ: SIMD-based Acceleration for Approximate Nearest Neighbor Search on ARM

We accelerate the 4-bit product quantization (PQ) on the ARM architectur...
research
09/04/2019

Deep Morphological Neural Networks

Mathematical morphology is a theory and technique to collect features li...
research
09/21/2019

Multithreaded Filtering Preconditioner for Diffusion Equation on Structured Grid

A parallel and nested version of a frequency filtering preconditioner is...
research
07/21/2018

Fast Matrix Inversion and Determinant Computation for Polarimetric Synthetic Aperture Radar

This paper introduces a fast algorithm for simultaneous inversion and de...

Please sign up or login with your details

Forgot password? Click here to reset