Four-splitting based coarse-grained multicomputer parallel algorithm for the optimal binary search tree problem

07/13/2022
by   Jerry Lacmou Zeutouo, et al.
0

This paper presents a parallel solution based on the coarse-grained multicomputer (CGM) model using the four-splitting technique to solve the optimal binary search tree problem. The well-known sequential algorithm of Knuth solves this problem in 𝒪(n^2) time and space, where n is the number of keys used to build the optimal binary search tree. To parallelize this algorithm on the CGM model, the irregular partitioning technique, consisting in subdividing the dependency graph into subgraphs (or blocks) of variable size, has been proposed to tackle the trade-off of minimizing the number of communication rounds and balancing the load of processors. This technique however induces a high latency time of processors (which accounts for most of the global communication time) because varying the blocks' sizes does not enable them to start evaluating some blocks as soon as the data they need are available. The four-splitting technique proposed in this paper solves this shortcoming by evaluating a block as a sequence of computation and communication steps of four subblocks. This CGM-based parallel solution requires 𝒪(n^2/√(p)) execution time with 𝒪( k √(p)) communication rounds, where p is the number of processors and k is the number of times the size of blocks is subdivided. An experimental study conducted to evaluate the performance of this CGM-based parallel solution showed that compared to the solution based on the irregular partitioning technique where the speedup factor is up to ×10.39 on one hundred and twenty-eight processors with 40960 keys when k = 2, the speedup factor of this solution is up to ×13.12 and rises up to ×14.93 when k = 5.

READ FULL TEXT

page 10

page 11

page 12

research
12/04/2017

The Saukas-Song Selection Algorithm and Coarse Grained Parallel Sorting

We analyze the running time of the Saukas-Song algorithm for selection o...
research
11/23/2017

On Optimal Trees for Irregular Gather and Scatter Collectives

This paper studies the complexity of finding cost-optimal communication ...
research
05/20/2022

(Poly)Logarithmic Time Construction of Round-optimal n-Block Broadcast Schedules for Broadcast and irregular Allgather in MPI

We give a fast(er), communication-free, parallel construction of optimal...
research
09/30/2017

An Efficient Load Balancing Method for Tree Algorithms

Nowadays, multiprocessing is mainstream with exponentially increasing nu...
research
09/26/2021

A Doubly-pipelined, Dual-root Reduction-to-all Algorithm and Implementation

We discuss a simple, binary tree-based algorithm for the collective allr...
research
02/22/2008

PVM-Distributed Implementation of the Radiance Code

The Parallel Virtual Machine (PVM) tool has been used for a distributed ...
research
04/07/2017

"RAPID" Regions-of-Interest Detection In Big Histopathological Images

The sheer volume and size of histopathological images (e.g.,10^6 MPixel)...

Please sign up or login with your details

Forgot password? Click here to reset