Node-Aware Improvements to Allreduce

10/21/2019
by   Amanda Bienz, et al.
0

The MPI_Allreduce collective operation is a core kernel of many parallel codebases, particularly for reductions over a single value per process. The commonly used allreduce recursive-doubling algorithm obtains the lower bound message count, yielding optimality for small reduction sizes based on node-agnostic performance models. However, this algorithm yields duplicate messages between sets of nodes. Node-aware optimizations in MPICH remove duplicate messages through use of a single master process per node, yielding a large number of inactive processes at each inter-node step. In this paper, we present an algorithm that uses the multiple processes available per node to reduce the maximum number of inter-node messages communicated by a single process, improving the performance of allreduce operations, particularly for small message sizes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/07/2022

A Locality-Aware Bruck Allgather

Collective algorithms are an essential part of MPI, allowing application...
research
01/20/2014

A Genetic Algorithm to Optimize a Tweet for Retweetability

Twitter is a popular microblogging platform. When users send out message...
research
04/25/2019

Genet: A Quickly Scalable Fat-Tree Overlay for Personal Volunteer Computing using WebRTC

WebRTC enables browsers to exchange data directly but the number of poss...
research
12/23/2016

Node Aware Sparse Matrix-Vector Multiplication

The sparse matrix-vector multiply (SpMV) operation is a key computationa...
research
04/11/2019

Reducing Communication in Algebraic Multigrid with Multi-step Node Aware Communication

Algebraic multigrid (AMG) is often viewed as a scalable 𝒪(n) solver for ...
research
08/26/2023

A Locality-Aware Sparse Dynamic Data Exchange

Parallel architectures are continually increasing in performance and sca...
research
03/09/2015

Kernel-Based Just-In-Time Learning for Passing Expectation Propagation Messages

We propose an efficient nonparametric strategy for learning a message op...

Please sign up or login with your details

Forgot password? Click here to reset