Integrated Time Series Summarization and Prediction Algorithm and its Application to COVID-19 Data Mining

05/01/2020
by   Mogens Graf Plessen, et al.
0

This paper proposes a simple method to extract from a set of multiple related time series a compressed representation for each time series based on statistics for the entire set of all time series. This is achieved by a hierarchical algorithm that first generates an alphabet of shapelets based on the segmentation of centroids for clustered data, before labels of these shapelets are assigned to the segmentation of each single time series via nearest neighbor search using unconstrained dynamic time warping as distance measure to deal with non-uniform time series lenghts. Thereby, a sequence of labels is assigned for each time series. Completion of the last label sequence permits prediction of individual time series. Proposed method is evaluated on two global COVID-19 datasets, first, for the number of daily net cases (daily new infections minus daily recoveries), and, second, for the number of daily deaths attributed to COVID-19 as of April 27, 2020. The first dataset involves 249 time series for different countries, each of length 96. The second dataset involves 264 time series, each of length 96. Based on detected anomalies in available data a decentralized exit strategy from lockdowns is advocated.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 1

page 10

03/07/2019

Fast Exact Dynamic Time Warping on Run-Length Encoded Time Series

Dynamic Time Warping (DTW) is a well-known similarity measure for time s...
09/02/2020

Data mining and time series segmentation via extrema: preliminary investigations

Time series segmentation is one of the many data mining tools. This pape...
11/27/2018

An Analytical Approach to Improving Time Warping on Multidimensional Time Series

Dynamic time warping (DTW) is one of the most used distance functions to...
07/01/2011

The Influence of Global Constraints on Similarity Measures for Time-Series Databases

A time series consists of a series of values or events obtained over rep...
10/17/2018

The UCR Time Series Archive

The UCR Time Series Archive - introduced in 2002, has become an importan...
08/14/2018

Plato: Approximate Analytics over Compressed Time Series with Tight Deterministic Error Guarantees

Plato provides sound and tight deterministic error guarantees for approx...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.