An efficient aggregation method for the symbolic representation of temporal data

01/14/2022
by   Xinye Chen, et al.
0

Symbolic representations are a useful tool for the dimension reduction of temporal data, allowing for the efficient storage of and information retrieval from time series. They can also enhance the training of machine learning algorithms on time series data through noise reduction and reduced sensitivity to hyperparameters. The adaptive Brownian bridge-based aggregation (ABBA) method is one such effective and robust symbolic representation, demonstrated to accurately capture important trends and shapes in time series. However, in its current form the method struggles to process very large time series. Here we present a new variant of the ABBA method, called fABBA. This variant utilizes a new aggregation approach tailored to the piecewise representation of time series. By replacing the k-means clustering used in ABBA with a sorting-based aggregation technique, and thereby avoiding repeated sum-of-squares error computations, the computational complexity is significantly reduced. In contrast to the original method, the new approach does not require the number of time series symbols to be specified in advance. Through extensive tests we demonstrate that the new method significantly outperforms ABBA with a considerable reduction in runtime while also outperforming the popular SAX and 1d-SAX representations in terms of reconstruction accuracy. We further demonstrate that fABBA can compress other data types such as images.

READ FULL TEXT
research
03/27/2020

ABBA: Adaptive Brownian bridge-based symbolic aggregation of time series

A new symbolic representation of time series, called ABBA, is introduced...
research
02/08/2023

ASTRIDE: Adaptive Symbolization for Time Series Databases

We introduce ASTRIDE (Adaptive Symbolization for Time seRIes DatabasEs),...
research
03/12/2020

Time Series Forecasting Using LSTM Networks: A Symbolic Approach

Machine learning methods trained on raw numerical time series data exhib...
research
09/02/2021

MrSQM: Fast Time Series Classification with Symbolic Representations

Symbolic representations of time series have proven to be effective for ...
research
09/26/2017

Symbolic Analysis-based Reduced Order Markov Modeling of Time Series Data

This paper presents a technique for reduced-order Markov modeling for co...
research
08/01/2017

Impact of different time series aggregation methods on optimal energy system design

Modelling renewable energy systems is a computationally-demanding task d...
research
04/09/2021

Granger Causality Based Hierarchical Time Series Clustering for State Estimation

Clustering is an unsupervised learning technique that is useful when wor...

Please sign up or login with your details

Forgot password? Click here to reset