Online Detection of Sparse Changes in High-Dimensional Data Streams Using Tailored Projections

08/06/2019
by   Martin Tveten, et al.
0

When applying principal component analysis (PCA) for dimension reduction, the most varying projections are usually used in order to retain most of the information. For the purpose of anomaly and change detection, however, the least varying projections are often the most important ones. In this article, we present a novel method that automatically tailors the choice of projections to monitor for sparse changes in the mean and/or covariance matrix of high-dimensional data. A subset of the least varying projections is almost always selected based on a criteria of the projection's sensitivity to changes. Our focus is on online/sequential change detection, where the aim is to detect changes as quickly as possible, while controlling false alarms at a specified level. A combination of tailored PCA and a generalized log-likelihood monitoring procedure displays high efficiency in detecting even very sparse changes in the mean, variance and correlation. We demonstrate on real data that tailored PCA monitoring is efficient for sparse change detection also when the data streams are highly auto-correlated and non-normal. Notably, error control is achieved without a large validation set, which is needed in most existing methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/15/2019

Which principal components are most sensitive to distributional changes?

PCA is often used in anomaly detection and statistical process control t...
research
04/11/2022

On unsupervised projections and second order signals

Linear projections are widely used in the analysis of high-dimensional d...
research
09/14/2017

Generalized Biplots for Multidimensional Scaled Projections

Dimension reduction and visualization is a staple of data analytics. Met...
research
06/22/2023

Adaptive Bernstein Change Detector for High-Dimensional Data Streams

Change detection is of fundamental importance when analyzing data stream...
research
12/14/2017

A Two-stage Online Monitoring Procedure for High-Dimensional Data Streams

Advanced computing and data acquisition technologies have made possible ...
research
09/22/2020

Partially Observable Online Change Detection via Smooth-Sparse Decomposition

We consider online change detection of high dimensional data streams wit...
research
04/20/2019

High Dimensional Process Monitoring Using Robust Sparse Probabilistic Principal Component Analysis

High dimensional data has introduced challenges that are difficult to ad...

Please sign up or login with your details

Forgot password? Click here to reset