Histogram Sort with Sampling

03/03/2018
by   Vipul Harsh, et al.
0

To minimize data movement, state-of-the-art parallel sorting algorithms use sampling and histogramming techniques to partition keys prior to redistribution. Samples enable partitioning to be done using representative subset of the keys, while histogramming enables evaluation and iterative improvement of a given partitioning. We introduce Histogram sort with sampling (HSS), which combines sampling and histogramming techniques to find high-quality partitions with minimal data movement and high practical performance. Compared to the best known algorithm for finding this partitioning, our algorithm requires a factor of Θ(log(p)/ log log(p)) less communication than the best known (recently introduced) alternative, and substantially less when compared to standard variants of Sample sort and Histogram sort. We provide a distributed-memory implementation of the proposed algorithm and compare its performance to two existing implementations, and provide a brief application study showing the benefit of the new algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/10/2022

Optimal Round and Sample-Size Complexity for Partitioning in Parallel Sorting

State-of-the-art parallel sorting algorithms for distributed-memory arch...
research
09/29/2018

On-line partitioning of width w posets into w^O(log log w) chains

An on-line chain partitioning algorithm receives the vertices of a poset...
research
08/01/2020

Data Oblivious Algorithms for Multicores

As secure processors such as Intel SGX (with hyperthreading) become wide...
research
10/26/2018

Some comments on the structure of the best known networks sorting 16 elements

We propose an explanation of the structure of the best known sorting net...
research
10/03/2011

Distributed Evolutionary Graph Partitioning

We present a novel distributed evolutionary algorithm, KaFFPaE, to solve...
research
04/24/2017

A Novel Hybrid Quicksort Algorithm Vectorized using AVX-512 on Intel Skylake

The modern CPU's design, which is composed of hierarchical memory and SI...
research
05/12/2022

Vectorized and performance-portable Quicksort

Recent works showed that implementations of Quicksort using vector CPU i...

Please sign up or login with your details

Forgot password? Click here to reset