RAD: On-line Anomaly Detection for Highly Unreliable Data

11/11/2019
by   Zilong Zhao, et al.
0

Classification algorithms have been widely adopted to detect anomalies for various systems, e.g., IoT, cloud and face recognition, under the common assumption that the data source is clean, i.e., features and labels are correctly set. However, data collected from the wild can be unreliable due to careless annotations or malicious data transformation for incorrect anomaly detection. In this paper, we present a two-layer on-line learning framework for robust anomaly detection (RAD) in the presence of unreliable anomaly labels, where the first layer is to filter out the suspicious data, and the second layer detects the anomaly patterns from the remaining data. To adapt to the on-line nature of anomaly detection, we extend RAD with additional features of repetitively cleaning, conflicting opinions of classifiers, and oracle knowledge. We on-line learn from the incoming data streams and continuously cleanse the data, so as to adapt to the increasing learning capacity from the larger accumulated data set. Moreover, we explore the concept of oracle learning that provides additional information of true labels for difficult data points. We specifically focus on three use cases, (i) detecting 10 classes of IoT attacks, (ii) predicting 4 classes of task failures of big data jobs, (iii) recognising 20 celebrities faces. Our evaluation results show that RAD can robustly improve the accuracy of anomaly detection, to reach up to 98 device attacks (i.e., +11 under 40 noisy labels. The proposed RAD is general and can be applied to different anomaly detection algorithms.

READ FULL TEXT
research
03/19/2021

Enhancing Robustness of On-line Learning Models on Highly Noisy Data

Classification algorithms have been widely adopted to detect anomalies f...
research
02/03/2021

Evaluation of Point Pattern Features for Anomaly Detection of Defect within Random Finite Set Framework

Defect detection in the manufacturing industry is of utmost importance f...
research
11/11/2016

Low Latency Anomaly Detection and Bayesian Network Prediction of Anomaly Likelihood

We develop a supervised machine learning model that detects anomalies in...
research
10/28/2022

Learning to Detect Interesting Anomalies

Anomaly detection algorithms are typically applied to static, unchanging...
research
07/10/2018

BAD: Blockchain Anomaly Detection

Anomaly detection tools play a role of paramount importance in protectin...
research
03/03/2022

Anomaly Detection in Big Data

Anomaly is defined as a state of the system that do not conform to the n...
research
06/13/2021

RadArnomaly: Protecting Radar Systems from Data Manipulation Attacks

Radar systems are mainly used for tracking aircraft, missiles, satellite...

Please sign up or login with your details

Forgot password? Click here to reset