A new tidy data structure to support exploration and modeling of temporal data

01/29/2019
by   Earo Wang, et al.
0

Mining temporal data for information is often inhibited by a multitude of formats: irregular or multiple time intervals, point events that need aggregating, multiple observational units or repeated measurements on multiple individuals, and heterogeneous data types. On the other hand, the software supporting time series modeling and forecasting, makes strict assumptions on the data to be provided, typically requiring a matrix of numeric data with implicit time indexes. Going from raw data to model-ready data is painful. This work presents a cohesive and conceptual framework for organizing and manipulating temporal data, which in turn flows into visualization, modeling and forecasting routines. Tidy data principles are extended to temporal data by: (1) mapping the semantics of a dataset into its physical layout; (2) including an explicitly declared index variable representing time; (3) incorporating a "key" comprising single or multiple variables to uniquely identify units over time. This tidy data representation most naturally supports thinking of operations on the data as building blocks, forming part of a "data pipeline" in time-based contexts. A sound data pipeline facilitates a fluent workflow for analyzing temporal data. The infrastructure of tidy temporal data has been implemented in the R package "tsibble".

READ FULL TEXT

page 19

page 20

page 21

page 23

page 24

research
04/07/2022

Few-Shot Forecasting of Time-Series with Heterogeneous Channels

Learning complex time series forecasting models usually requires a large...
research
12/29/2021

AutoFITS: Automatic Feature Engineering for Irregular Time Series

A time series represents a set of observations collected over time. Typi...
research
12/21/2021

AutoCTS: Automated Correlated Time Series Forecasting – Extended Version

Correlated time series (CTS) forecasting plays an essential role in many...
research
08/28/2020

Pay Attention to Evolution: Time Series Forecasting with Deep Graph-Evolution Learning

Time-series forecasting is one of the most active research topics in pre...
research
01/30/2021

Conceptual Temporal Modeling Applied to Databases

We present a different approach to developing a concept of time for spec...
research
05/19/2022

Extract Dynamic Information To Improve Time Series Modeling: a Case Study with Scientific Workflow

In modeling time series data, we often need to augment the existing data...

Please sign up or login with your details

Forgot password? Click here to reset