A framework for automated anomaly detection in high frequency water-quality data from in situ sensors

10/31/2018
by   Catherine Leigh, et al.
0

River water-quality monitoring is increasingly conducted using automated in situ sensors, enabling timelier identification of unexpected values. However, anomalies caused by technical issues confound these data, while the volume and velocity of data prevent manual detection. We present a framework for automated anomaly detection in high-frequency water-quality data from in situ sensors, using turbidity, conductivity and river level data. After identifying end-user needs and defining anomalies, we ranked their importance and selected suitable detection methods. High priority anomalies included sudden isolated spikes and level shifts, most of which were classified correctly by regression-based methods such as autoregressive integrated moving average models. However, using other water-quality variables as covariates reduced performance due to complex relationships among variables. Classification of drift and periods of anomalously low or high variability improved when we applied replaced anomalous measurements with forecasts, but this inflated false positive rates. Feature-based methods also performed well on high priority anomalies, but were also less proficient at detecting lower priority anomalies, resulting in high false negative rates. Unlike regression-based methods, all feature-based methods produced low false positive rates, but did not and require training or optimization. Rule-based methods successfully detected impossible values and missing observations. Thus, we recommend using a combination of methods to improve anomaly detection performance, whilst minimizing false detection rates. Furthermore, our framework emphasizes the importance of communication between end-users and analysts for optimal outcomes with respect to both detection performance and end-user needs. Our framework is applicable to other types of high frequency time-series data and anomaly detection applications.

READ FULL TEXT
research
03/13/2019

Crowdsourced wireless spectrum anomaly detection

Automated wireless spectrum monitoring across frequency, time and space ...
research
07/03/2023

The ROAD to discovery: machine learning-driven anomaly detection in radio astronomy spectrograms

As radio telescopes increase in sensitivity and flexibility, so do their...
research
02/17/2019

A feature-based framework for detecting technical outliers in water-quality data from in situ sensors

Outliers due to technical errors in water-quality data from in situ sens...
research
01/15/2019

MAD-GAN: Multivariate Anomaly Detection for Time Series Data with Generative Adversarial Networks

The prevalence of networked sensors and actuators in many real-world sys...
research
10/30/2018

Predicting Sediment and Nutrient Concentrations in Rivers Using High Frequency Water Quality Surrogates

A particular focus of water-quality monitoring is the concentrations of ...
research
05/06/2021

Honeyboost: Boosting honeypot performance with data fusion and anomaly detection

With cyber incidents and data breaches becoming increasingly common, bei...
research
07/13/2022

Spatial anomaly detection with optimal transport

This manuscript outlines an automated anomaly detection framework for je...

Please sign up or login with your details

Forgot password? Click here to reset