LearnedSort as a learning-augmented SampleSort: Analysis and Parallelization

07/17/2023
by   Ivan Carvalho, et al.
0

This work analyzes and parallelizes LearnedSort, the novel algorithm that sorts using machine learning models based on the cumulative distribution function. LearnedSort is analyzed under the lens of algorithms with predictions, and it is argued that LearnedSort is a learning-augmented SampleSort. A parallel LearnedSort algorithm is developed combining LearnedSort with the state-of-the-art SampleSort implementation, IPS4o. Benchmarks on synthetic and real-world datasets demonstrate improved parallel performance for parallel LearnedSort compared to IPS4o and other sorting algorithms.

READ FULL TEXT

page 7

page 8

research
07/05/2021

Defeating duplicates: A re-design of the LearnedSort algorithm

LearnedSort is a novel sorting algorithm that, unlike traditional method...
research
04/20/2023

High-Performance and Flexible Parallel Algorithms for Semisort and Related Problems

Semisort is a fundamental algorithmic primitive widely used in the desig...
research
06/24/2022

Learning Augmented Binary Search Trees

A treap is a classic randomized binary search tree data structure that i...
research
12/30/2022

A Global Optimization Algorithm for K-Center Clustering of One Billion Samples

This paper presents a practical global optimization algorithm for the K-...
research
09/18/2023

LLM4Jobs: Unsupervised occupation extraction and standardization leveraging Large Language Models

Automated occupation extraction and standardization from free-text job p...
research
05/17/2018

Practical Algorithms for STV and Ranked Pairs with Parallel Universes Tiebreaking

STV and ranked pairs (RP) are two well-studied voting rules for group de...
research
02/06/2023

A Scalable and Efficient Iterative Method for Copying Machine Learning Classifiers

Differential replication through copying refers to the process of replic...

Please sign up or login with your details

Forgot password? Click here to reset