tsrobprep - an R package for robust preprocessing of time series data

04/26/2021
by   Michał Narajewski, et al.
0

Data cleaning is a crucial part of every data analysis exercise. Yet, the currently available R packages do not provide fast and robust methods for cleaning and preparation of time series data. The open source package tsrobprep introduces efficient methods for handling missing values and outliers using model based approaches. For data imputation a probabilistic replacement model is proposed, which may consist of autoregressive components and external inputs. For outlier detection a clustering algorithm based on finite mixture modelling is introduced, which considers time series properties in terms of the gradient and the underlying seasonality as features. The procedure allows to return a probability for each observation being outlying data as well as a specific cause for an outlier assignment in terms of the provided feature space. The methods work robust and are fully tunable. Moreover, by providing the auto_data_cleaning function the data preprocessing can be carried out in one cast, without comprehensive tuning and providing suitable results. The primary motivation of the package is the preprocessing of energy system data. We present application for electricity load, wind and solar power data.

READ FULL TEXT

page 4

page 7

research
06/30/2022

K-ARMA Models for Clustering Time Series Data

We present an approach to clustering time series data using a model-base...
research
01/27/2023

fnets: An R Package for Network Estimation and Forecasting via Factor-Adjusted VAR Modelling

The package fnets for the R language implements the suite of methodologi...
research
08/18/2023

Development of a Neural Network-based Method for Improved Imputation of Missing Values in Time Series Data by Repurposing DataWig

Time series data are observations collected over time intervals. Success...
research
04/24/2023

Ordinal time series analysis with the R package otsfeatures

The 21st century has witnessed a growing interest in the analysis of tim...
research
04/24/2023

Analyzing categorical time series with the R package ctsfeatures

Time series data are ubiquitous nowadays. Whereas most of the literature...
research
04/07/2022

Robust and Explainable Autoencoders for Unsupervised Time Series Outlier Detection—Extended Version

Time series data occurs widely, and outlier detection is a fundamental p...

Please sign up or login with your details

Forgot password? Click here to reset