RADE: Resource-Efficient Supervised Anomaly Detection Using Decision Tree-Based Ensemble Methods

09/26/2019
by   Shay Vargaftik, et al.
0

Decision-tree-based ensemble classification methods (DTEMs) are a prevalent tool for supervised anomaly detection. However, due to the continued growth of datasets, DTEMs result in increasing drawbacks such as growing memory footprints, longer training times, and slower classification latencies at lower throughput. In this paper, we present, design, and evaluate RADE - a DTEM-based anomaly detection framework that augments standard DTEM classifiers and alleviates these drawbacks by relying on two observations: (1) we find that a small (coarse-grained) DTEM model is sufficient to classify the majority of the classification queries correctly, such that a classification is valid only if its corresponding confidence level is greater than or equal to a predetermined classification confidence threshold; (2) we find that in these fewer harder cases where our coarse-grained DTEM model results in insufficient confidence in its classification, we can improve it by forwarding the classification query to one of expert DTEM (fine-grained) models, which is explicitly trained for that particular case. We implement RADE in Python based on scikit-learn and evaluate it over different DTEM methods: RF, XGBoost, AdaBoost, GBDT and LightGBM, and over three publicly available datasets. Our evaluation over both a strong AWS EC2 instance and a Raspberry Pi 3 device indicates that RADE offers competitive and often superior anomaly detection capabilities as compared to standard DTEM methods, while significantly improving memory footprint (by up to 5.46x), training-time (by up to 17.2x), and classification latency (by up to 31.2x).

READ FULL TEXT
research
05/14/2021

Anomaly Detection in Cybersecurity: Unsupervised, Graph-Based and Supervised Learning Methods in Adversarial Environments

Machine learning for anomaly detection has become a widely researched fi...
research
04/20/2021

Fine-grained Anomaly Detection via Multi-task Self-Supervision

Detecting anomalies using deep learning has become a major challenge ove...
research
05/28/2020

Combining Fine- and Coarse-Grained Classifiers for Diabetic Retinopathy Detection

Visual artefacts of early diabetic retinopathy in retinal fundus images ...
research
03/21/2022

ASE: Anomaly Scoring Based Ensemble Learning for Imbalanced Datasets

Nowadays, many industries have applied classification algorithms to help...
research
02/28/2023

Dissolving Is Amplifying: Towards Fine-Grained Anomaly Detection

Medical anomalous data normally contains fine-grained instance-wise addi...
research
09/05/2022

FIRED: a fine-grained robust performance diagnosis framework for cloud applications

To run a cloud application with the required service quality, operators ...
research
09/03/2020

An Ensemble Learning Approach for In-situ Monitoring of FPGA Dynamic Power

As field-programmable gate arrays become prevalent in critical applicati...

Please sign up or login with your details

Forgot password? Click here to reset