Calculating the matrix profile from noisy data

06/16/2023
by   Colin Hehir, et al.
0

The matrix profile (MP) is a data structure computed from a time series which encodes the data required to locate motifs and discords, corresponding to recurring patterns and outliers respectively. When the time series contains noisy data then the conventional approach is to pre-filter it in order to remove noise but this cannot apply in unsupervised settings where patterns and outliers are not annotated. The resilience of the algorithm used to generate the MP when faced with noisy data remains unknown. We measure the similarities between the MP from original time series data with MPs generated from the same data with noisy data added under a range of parameter settings including adding duplicates and adding irrelevant data. We use three real world data sets drawn from diverse domains for these experiments Based on dissimilarities between the MPs, our results suggest that MP generation is resilient to a small amount of noise being introduced into the data but as the amount of noise increases this resilience disappears

READ FULL TEXT

page 5

page 7

research
11/05/2018

Towards a Near Universal Time Series Data Mining Tool: Introducing the Matrix Profile

The last decade has seen a flurry of research on all-pairs-similarity-se...
research
12/07/2018

Time Series Featurization via Topological Data Analysis

We develop a novel algorithm for feature extraction in time series data ...
research
09/02/2022

Estimation of Correlation Matrices from Limited time series Data using Machine Learning

Prediction of correlation matrices from given time series data has sever...
research
08/24/2023

A Co-training Approach for Noisy Time Series Learning

In this work, we focus on robust time series representation learning. Ou...
research
12/24/2021

Error-bounded Approximate Time Series Joins using Compact Dictionary Representations of Time Series

The matrix profile is an effective data mining tool that provides simila...
research
02/02/2010

Detecting Motifs in System Call Sequences

The search for patterns or motifs in data represents an area of key inte...
research
01/04/2023

PMP: Privacy-Aware Matrix Profile against Sensitive Pattern Inference for Time Series

Recent rapid development of sensor technology has allowed massive fine-g...

Please sign up or login with your details

Forgot password? Click here to reset