CASC: Context-Aware Segmentation and Clustering for Motif Discovery in Noisy Time Series Data

09/06/2018
by   Saachi Jain, et al.
0

Complex systems, such as airplanes, cars, or financial markets, produce multivariate time series data consisting of system observations over a period of time. Such data can be interpreted as a sequence of segments, where each segment is associated with a certain state of the system. An important problem in this domain is to identify repeated sequences of states, known as motifs. Such motifs correspond to complex behaviors that capture common sequences of state transitions. For example, a motif of "making a turn" might manifest in sensor data as a sequence of states: slowing down, turning the wheel, and then speeding back up. However, discovering these motifs is challenging, because the individual states are unknown and need to be learned from the noisy time series. Simultaneously, the time series also needs to be precisely segmented and each segment needs to be associated with a state. Here we develop context-aware segmentation and clustering (CASC), a method for discovering common motifs in time series data. We formulate the problem of motif discovery as a large optimization problem, which we then solve using a greedy alternating minimization-based approach. CASC performs well in the presence of noise in the input data and is scalable to very large datasets. Furthermore, CASC leverages common motifs to more robustly segment the time series and assign segments to states. Experiments on synthetic data show that CASC outperforms state-of-the-art baselines by up to 38.2 our approach discovers insightful motifs in real-world time series data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/27/2020

Bayesian nonparametric shared multi-sequence time series segmentation

In this paper, we introduce a method for segmenting time series data usi...
research
01/02/2021

Optimal Segmented Linear Regression for Financial Time Series Segmentation

Given a financial time series data, one of the most fundamental and inte...
research
06/29/2020

Neural Time Warping For Multiple Sequence Alignment

Multiple sequences alignment (MSA) is a traditional and challenging task...
research
05/10/2019

Capturing Evolution Genes for Time Series Data

The modeling of time series is becoming increasingly critical in a wide ...
research
12/07/2018

seq2graph: Discovering Dynamic Dependencies from Multivariate Time Series with Multi-level Attention

Discovering temporal lagged and inter-dependencies in multivariate time ...
research
11/20/2022

TSEXPLAIN: Explaining Aggregated Time Series by Surfacing Evolving Contributors

Aggregated time series are generated effortlessly everywhere, e.g., "tot...
research
05/29/2019

Flexible Mining of Prefix Sequences from Time-Series Traces

Mining temporal assertions from time-series data using information theor...

Please sign up or login with your details

Forgot password? Click here to reset