Onesweep: A Faster Least Significant Digit Radix Sort for GPUs

06/03/2022
by   Andy Adinets, et al.
0

We present Onesweep, a least-significant digit (LSD) radix sorting algorithm for large GPU sorting problems residing in global memory. Our parallel algorithm employs a method of single-pass prefix sum that only requires  2n global read/write operations for each digit-binning iteration. This exhibits a significant reduction in last-level memory traffic versus contemporary GPU radix sorting implementations, where each iteration of digit binning requires two passes through the dataset totaling  3n global memory operations. On the NVIDIA A100 GPU, our approach achieves 29.4 GKey/s when sorting 256M random 32-bit keys. Compared to CUB, the current state-of-the-art GPU LSD radix sort, our approach provides a speedup of  1.5x. For 32-bit keys with varied distributions, our approach provides more consistent performance compared to HRS, the current state-of-the-art GPU MSD radix sort, and outperforms it in almost all cases.

READ FULL TEXT
research
05/11/2018

An O(N) Sorting Algorithm: Machine Learning Sorting

We propose an O(N) sorting algorithm based on Machine Learning method, w...
research
06/11/2021

Bandwidth-Optimal Random Shuffling for GPUs

Linear-time algorithms that are traditionally used to shuffle data on CP...
research
09/19/2023

Fast and reconfigurable sort-in-memory system enabled by memristors

Sorting is fundamental and ubiquitous in modern computing systems. Hardw...
research
07/05/2021

Defeating duplicates: A re-design of the LearnedSort algorithm

LearnedSort is a novel sorting algorithm that, unlike traditional method...
research
12/17/2020

Sorting in Memristive Memory

Sorting is needed in many application domains. The data is read from mem...
research
01/26/2015

GPU Programming - Speeding Up the 3D Surface Generator VESTA

The novel "Volume-Enclosing Surface exTraction Algorithm" (VESTA) genera...
research
07/20/2019

NNS: The Case For Neural Network-based Sorting

CPU-SIMD/GPU/TPUs will be increasingly powerful. The algorithm using neu...

Please sign up or login with your details

Forgot password? Click here to reset