A cost effective and reliable environment monitoring system for HPC applications

01/29/2018
by   Peter Bernd Otte, et al.
0

We present a slow control system to gather all relevant environment information necessary to effectively and reliably run an HPC (High Performance Computing) system at a high value over price ratio. The scalable and reliable overall concept is presented as well as a newly developed hardware device for sensor read out. This device incorporates a Raspberry Pi, an Arduino and PoE (Power over Ethernet) functionality in a compact form factor. The system is in use at the 2 PFLOPS cluster of the Johannes Gutenberg-University and Helmholtz-Institute in Mainz.

READ FULL TEXT

page 3

page 5

research
06/13/2018

The importance and need for system monitoring and analysis in HPC operations and research

In this work, system monitoring and analysis are discussed in terms of t...
research
07/11/2018

Eurolab-4-HPC Long-Term Vision on High-Performance Computing

Radical changes in computing are foreseen for the next decade. The US IE...
research
06/18/2016

Scalability of VM Provisioning Systems

Virtual machines and virtualized hardware have been around for over half...
research
07/15/2023

PSI/J: A Portable Interface for Submitting, Monitoring, and Managing Jobs

It is generally desirable for high-performance computing (HPC) applicati...
research
06/07/2018

Dwarf in a Giant: Enabling Scalable, High-Resolution HPC Energy Monitoring for Real-Time Profiling and Analytics

Energy efficiency, predictive maintenance and security are today key cha...
research
09/28/2021

A Look at Communication-Intensive Performance in Julia

The Julia programming language continues to gain popularity both for its...
research
03/30/2020

Building a Shared Resource HPC Center Across University Schools and Institutes: A Case Study

Over the past several years, The George Washington University has recrui...

Please sign up or login with your details

Forgot password? Click here to reset