pPython for Parallel Python Programming

08/31/2022
by   Chansup Byun, et al.
0

pPython seeks to provide a parallel capability that provides good speed-up without sacrificing the ease of programming in Python by implementing partitioned global array semantics (PGAS) on top of a simple file-based messaging library (PythonMPI) in pure Python. The core data structure in pPython is a distributed numerical array whose distribution onto multiple processors is specified with a map construct. Communication operations between distributed arrays are abstracted away from the user and pPython transparently supports redistribution between any block-cyclic-overlapped distributions in up to four dimensions. pPython follows a SPMD (single program multiple data) model of computation. pPython runs on any combination of heterogeneous systems that support Python, including Windows, Linux, and MacOS operating systems. In addition to running transparently on single-node (e.g., a laptop), pPython provides a scheduler interface, so that pPython can be executed in a massively parallel computing environment. The initial implementation uses the Slurm scheduler. Performance of pPython on the HPC Challenge benchmark suite demonstrates both ease of programming and scalability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/07/2023

pPython Performance Study

pPython seeks to provide a parallel capability that provides good speed-...
research
04/26/2021

A PGAS Communication Library for Heterogeneous Clusters

This work presents a heterogeneous communication library for clusters of...
research
03/03/2019

Development details and computational benchmarking of DEPAM

In the big data era of observational oceanography, passive acoustics dat...
research
09/24/2016

Benchmarking SciDB Data Import on HPC Systems

SciDB is a scalable, computational database management system that uses ...
research
11/23/2015

A Python Extension for the Massively Parallel Multiphysics Simulation Framework waLBerla

We present a Python extension to the massively parallel HPC simulation t...
research
08/30/2022

The BioExcel methodology for developing dynamic, scalable, reliable and portable computational biomolecular workflows

Developing complex biomolecular workflows is not always straightforward....
research
03/22/2021

hep_tables: Heterogeneous Array Programming for HEP

Array operations are one of the most concise ways of expressing common f...

Please sign up or login with your details

Forgot password? Click here to reset