Algorithms for a Topology-aware Massively Parallel Computation Model

09/24/2020
by   Xiao Hu, et al.
0

Most of the prior work in massively parallel data processing assumes homogeneity, i.e., every computing unit has the same computational capability, and can communicate with every other unit with the same latency and bandwidth. However, this strong assumption of a uniform topology rarely holds in practical settings, where computing units are connected through complex networks. To address this issue, Blanas et al. recently proposed a topology-aware massively parallel computation model that integrates the network structure and heterogeneity in the modeling cost. The network is modeled as a directed graph, where each edge is associated with a cost function that depends on the data transferred between the two endpoints. The computation proceeds in synchronous rounds, and the cost of each round is measured as the maximum cost over all the edges in the network. In this work, we take the first step into investigating three fundamental data processing tasks in this topology-aware parallel model: set intersection, cartesian product, and sorting. We focus on network topologies that are tree topologies, and present both lower bounds, as well as (asymptotically) matching upper bounds. The optimality of our algorithms is with respect to the initial data distribution among the network nodes, instead of assuming worst-case distribution as in previous results. Apart from the theoretical optimality of our results, our protocols are simple, use a constant number of rounds, and we believe can be implemented in practical settings as well.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/12/2020

Topology Dependent Bounds For FAQs

In this paper, we prove topology dependent bounds on the number of round...
research
03/15/2018

CIM/E Oriented Graph Database Model Architecture and Parallel Network Topology Processing

CIM/E is an easy and efficient electric power model exchange standard be...
research
02/25/2023

The Hardness of Optimization Problems on the Weighted Massively Parallel Computation Model

The topology-aware Massively Parallel Computation (MPC) model is propose...
research
04/10/2022

Optimal Round and Sample-Size Complexity for Partitioning in Parallel Sorting

State-of-the-art parallel sorting algorithms for distributed-memory arch...
research
07/23/2018

Random Sampling Applied to the MST Problem in the Node Congested Clique Model

The Congested Clique model, proposed by Lotker et al. [SPAA'03, SICOMP'0...
research
10/16/2018

Always be Two Steps Ahead of Your Enemy

We investigate the maintenance of overlay networks under massive churn, ...
research
09/28/2020

Models of Smoothing in Dynamic Networks

Smoothed analysis is a framework suggested for mediating gaps between wo...

Please sign up or login with your details

Forgot password? Click here to reset