Anomaly Detection using Autoencoders in High Performance Computing Systems

11/13/2018
by   Andrea Borghesi, et al.
0

Anomaly detection in supercomputers is a very difficult problem due to the big scale of the systems and the high number of components. The current state of the art for automated anomaly detection employs Machine Learning methods or statistical regression models in a supervised fashion, meaning that the detection tool is trained to distinguish among a fixed set of behaviour classes (healthy and unhealthy states). We propose a novel approach for anomaly detection in High Performance Computing systems based on a Machine (Deep) Learning technique, namely a type of neural network called autoencoder. The key idea is to train a set of autoencoders to learn the normal (healthy) behaviour of the supercomputer nodes and, after training, use them to identify abnormal conditions. This is different from previous approaches which where based on learning the abnormal condition, for which there are much smaller datasets (since it is very hard to identify them to begin with). We test our approach on a real supercomputer equipped with a fine-grained, scalable monitoring infrastructure that can provide large amount of data to characterize the system behaviour. The results are extremely promising: after the training phase to learn the normal system behaviour, our method is capable of detecting anomalies that have never been seen before with a very good accuracy (values ranging between 88

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/14/2023

Masked Autoencoders for Unsupervised Anomaly Detection in Medical Images

Pathological anomalies exhibit diverse appearances in medical imaging, m...
research
09/14/2020

DAICS: A Deep Learning Solution for Anomaly Detection in Industrial Control Systems

Deep Learning is emerging as an effective technique to detect sophistica...
research
02/22/2019

Online Anomaly Detection in HPC Systems

Reliability is a cumbersome problem in High Performance Computing System...
research
10/02/2018

Landmine Detection Using Autoencoders on Multi-polarization GPR Volumetric Data

Buried landmines and unexploded remnants of war are a constant threat fo...
research
04/22/2021

Unsupervised anomaly detection for a Smart Autonomous Robotic Assistant Surgeon (SARAS)using a deep residual autoencoder

Anomaly detection in Minimally-Invasive Surgery (MIS) traditionally requ...
research
01/27/2021

Autoencoder-based Condition Monitoring and Anomaly Detection Method for Rotating Machines

Rotating machines like engines, pumps, or turbines are ubiquitous in mod...
research
03/29/2022

Radial Autoencoders for Enhanced Anomaly Detection

In classification problems, supervised machine-learning methods outperfo...

Please sign up or login with your details

Forgot password? Click here to reset