Forecasting with sktime: Designing sktime's New Forecasting API and Applying It to Replicate and Extend the M4 Study

05/16/2020
by   Markus Löning, et al.
1

We present a new open-source framework for forecasting in Python. Our framework forms part of sktime, a machine learning toolbox with a unified interface for different time series learning tasks, like forecasting, but also time series classification and regression. We provide a dedicated forecasting interface, common statistical algorithms, and scikit-learn compatible tools for building composite machine learning models. We use sktime to both replicate key results from the M4 forecasting study and to extend it. sktime allows to easily build, tune and evaluate new models. We investigate the potential of common machine learning techniques for univariate forecasting, including reduction, boosting, ensembling, pipelining and tuning. We find that simple hybrid models can boost the performance of statistical models, and that pure machine learning models can achieve competitive forecasting performance on the hourly data sets, outperforming the statistical algorithms and coming close to the M4 winner model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/17/2019

sktime: A Unified Interface for Machine Learning with Time Series

We present sktime -- a new scikit-learn compatible Python library with a...
research
03/13/2023

Large statistical learning models effectively forecast diverse chaotic systems

Chaos and unpredictability are traditionally synonymous, yet recent adva...
research
09/15/2016

cesium: Open-Source Platform for Time-Series Inference

Inference on time series data is a common requirement in many scientific...
research
08/31/2023

Forecasting Emergency Department Crowding with Advanced Machine Learning Models and Multivariable Input

Emergency department (ED) crowding is a significant threat to patient sa...
research
09/09/2018

Performance Metrics (Error Measures) in Machine Learning Regression, Forecasting and Prognostics: Properties and Typology

Performance metrics (error measures) are vital components of the evaluat...
research
02/08/2022

The Lifecycle of a Statistical Model: Model Failure Detection, Identification, and Refitting

The statistical machine learning community has demonstrated considerable...
research
10/07/2021

Darts: User-Friendly Modern Machine Learning for Time Series

We present Darts, a Python machine learning library for time series, wit...

Please sign up or login with your details

Forgot password? Click here to reset