Analyzing the HCP Datasets using GPUs: The Anatomy of a Science Engagement

09/07/2019
by   John-Paul Robinson, et al.
0

This paper documents the experience improving the performance of a data processing workflow for analysis of the Human Connectome Project's HCP900 data set. It describes how network and compute bottlenecks were discovered and resolved during the course of a science engagement. A series of computational enhancements to the stock FSL BedpostX workflow are described. These enhancements migrated the workflow from a slow serial execution of computations resulting from Slurm scheduler incompatibilities to eventual execution on GPU resources, going from a 21-day execution on a single CPU core to a 2 hour execution on a GPU. This workflow contributed a vital use-case to the build-out of the campus compute cluster with additional GPUs and resulted in enhancements to network bandwidth. It also shares insights on potential improvements to distribution of scientific software to avoid stagnation in site-specific deployment decisions. The discussion highlights the advantages of open licenses and popular code collaboration sites like GitHub.com in feeding contributions upstream.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/04/2020

StreamFlow: cross-breeding cloud with HPC

Workflows are among the most commonly used tools in a variety of executi...
research
11/28/2022

Adding Workflow Management Flexibility to LSST Pipelines Execution

Data processing pipelines need to be executed at scales ranging from sma...
research
06/04/2020

Portability of Scientific Workflows in NGS Data Analysis: A Case Study

The analysis of next-generation sequencing (NGS) data requires complex c...
research
05/18/2022

The anachronism of whole-GPU accounting

NVIDIA has been making steady progress in increasing the compute perform...
research
12/18/2019

Scheduling Algorithms for Efficient Execution of Stream Workflow Applications in Multicloud Environments

Big data processing applications are becoming more and more complex. The...
research
03/06/2022

Managing Complex Workflows in Bioinformatics - An Interactive Toolkit with GPU Acceleration

Bioinformatics research continues to advance at an increasing scale with...
research
07/08/2021

Expanding IceCube GPU computing into the Clouds

The IceCube collaboration relies on GPU compute for many of its needs, i...

Please sign up or login with your details

Forgot password? Click here to reset