Deploying AI Frameworks on Secure HPC Systems with Containers

05/24/2019
by   David Brayford, et al.
0

The increasing interest in the usage of Artificial Intelligence techniques (AI) from the research community and industry to tackle "real world" problems, requires High Performance Computing (HPC) resources to efficiently compute and scale complex algorithms across thousands of nodes. Unfortunately, typical data scientists are not familiar with the unique requirements and characteristics of HPC environments. They usually develop their applications with high-level scripting languages or frameworks such as TensorFlow and the installation process often requires connection to external systems to download open source software during the build. HPC environments, on the other hand, are often based on closed source applications that incorporate parallel and distributed computing API's such as MPI and OpenMP, while users have restricted administrator privileges, and face security restrictions such as not allowing access to external systems. In this paper we discuss the issues associated with the deployment of AI frameworks in a secure HPC environment and how we successfully deploy AI frameworks on SuperMUC-NG with Charliecloud.

READ FULL TEXT
research
05/20/2020

Deploying Scientific AI Networks at Petaflop Scale on Secure Large Scale HPC Production Systems with Containers

There is an ever-increasing need for computational power to train comple...
research
03/26/2021

Secure Platform for Processing Sensitive Data on Shared HPC Systems

High performance computing clusters operating in shared and batch mode p...
research
11/23/2020

Integrating Deep Learning in Domain Sciences at Exascale

This paper presents some of the current challenges in designing deep lea...
research
05/19/2022

SOL: Reducing the Maintenance Overhead for Integrating Hardware Support into AI Frameworks

The increased interest in Artificial Intelligence (AI) raised the need f...
research
10/11/2021

Deploying Containerized QuantEx Quantum Simulation Software on HPC Systems

The simulation of quantum circuits using the tensor network method is ve...
research
12/13/2022

Towards Seamless Management of AI Models in High-Performance Computing

With the increasing prevalence of artificial intelligence (AI) in divers...
research
04/15/2021

Minimizing privilege for building HPC containers

HPC centers face increasing demand for software flexibility, and there i...

Please sign up or login with your details

Forgot password? Click here to reset