Integrated Time Series Summarization and Prediction Algorithm and its Application to COVID-19 Data Mining

by   Mogens Graf Plessen, et al.

This paper proposes a simple method to extract from a set of multiple related time series a compressed representation for each time series based on statistics for the entire set of all time series. This is achieved by a hierarchical algorithm that first generates an alphabet of shapelets based on the segmentation of centroids for clustered data, before labels of these shapelets are assigned to the segmentation of each single time series via nearest neighbor search using unconstrained dynamic time warping as distance measure to deal with non-uniform time series lenghts. Thereby, a sequence of labels is assigned for each time series. Completion of the last label sequence permits prediction of individual time series. Proposed method is evaluated on two global COVID-19 datasets, first, for the number of daily net cases (daily new infections minus daily recoveries), and, second, for the number of daily deaths attributed to COVID-19 as of April 27, 2020. The first dataset involves 249 time series for different countries, each of length 96. The second dataset involves 264 time series, each of length 96. Based on detected anomalies in available data a decentralized exit strategy from lockdowns is advocated.



There are no comments yet.


page 1

page 10


Fast Exact Dynamic Time Warping on Run-Length Encoded Time Series

Dynamic Time Warping (DTW) is a well-known similarity measure for time s...

Data mining and time series segmentation via extrema: preliminary investigations

Time series segmentation is one of the many data mining tools. This pape...

An Analytical Approach to Improving Time Warping on Multidimensional Time Series

Dynamic time warping (DTW) is one of the most used distance functions to...

The Influence of Global Constraints on Similarity Measures for Time-Series Databases

A time series consists of a series of values or events obtained over rep...

The UCR Time Series Archive

The UCR Time Series Archive - introduced in 2002, has become an importan...

Plato: Approximate Analytics over Compressed Time Series with Tight Deterministic Error Guarantees

Plato provides sound and tight deterministic error guarantees for approx...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.