Aggregating over Dominated Points by Sorting, Scanning, Zip and Flat Maps

05/26/2023
by   Jacek Sroka, et al.
0

Prefix aggregation operation (also called scan), and its particular case, prefix summation, is an important parallel primitive and enjoys a lot of attention in the research literature. It is also used in many algorithms as one of the steps. Aggregation over dominated points in ℝ^m is a multidimensional generalisation of prefix aggregation. It is also intensively researched, both as a parallel primitive and as a practical problem, encountered in computational geometry, spatial databases and data warehouses. In this paper we show that, for a constant dimension m, aggregation over dominated points in ℝ^m can be computed by O(1) basic operations that include sorting the whole dataset, zipping sorted lists of elements, computing prefix aggregations of lists of elements and flat maps, which expand the data size from initial n to nlog^m-1n. Thereby we establish that prefix aggregation suffices to express aggregation over dominated points in more dimensions, even though the latter is a far-reaching generalisation of the former. Many problems known to be expressible by aggregation over dominated points become expressible by prefix aggregation, too. We rely on a small set of primitive operations which guarantee an easy transfer to various distributed architectures and some desired properties of the implementation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/17/2018

Merge Non-Dominated Sorting Algorithm for Many-Objective Optimization

Many Pareto-based multi-objective evolutionary algorithms require to ran...
research
10/16/2014

Multivariate Spearman's rho for aggregating ranks using copulas

We study the problem of rank aggregation: given a set of ranked lists, w...
research
03/25/2022

Rank-based Non-dominated Sorting

Non-dominated sorting is a computational bottleneck in Pareto-based mult...
research
04/14/2018

On Asynchronous Non-Dominated Sorting for Steady-State Multiobjective Evolutionary Algorithms

In parallel and distributed environments, generational evolutionary algo...
research
11/22/2021

Functorial aggregation

Aggregating data in a database could also be called "integrating along f...
research
10/01/2020

Sort-based grouping and aggregation

Database query processing requires algorithms for duplicate removal, gro...
research
10/01/2018

Chasing Similarity: Distribution-aware Aggregation Scheduling (Extended Version)

Parallel aggregation is a ubiquitous operation in data analytics that is...

Please sign up or login with your details

Forgot password? Click here to reset