An Interpretable Probabilistic Model for Short-Term Solar Power Forecasting Using Natural Gradient Boosting

08/05/2021
by   Georgios Mitrentsis, et al.
12

The stochastic nature of photovoltaic (PV) power has led both academia and industry to a large amount of research work aiming at the development of accurate PV power forecasting models. However, most of those models are based on machine learning algorithms and are considered as black boxes which do not provide any insight or explanation about their predictions. Therefore, their direct implementation in environments, where transparency is required, and the trust associated with their predictions may be questioned. To this end, we propose a two stage probabilistic forecasting framework able to generate highly accurate, reliable, and sharp forecasts yet offering full transparency on both the point forecasts and the prediction intervals (PIs). In the first stage, we exploit natural gradient boosting (NGBoost) for yielding probabilistic forecasts while in the second stage, we calculate the Shapley additive explanation (SHAP) values in order to fully understand why a prediction was made. To highlight the performance and the applicability of the proposed framework, real data from two PV parks located in Southern Germany are employed. Initially, the natural gradient boosting is thoroughly compared with two state-of-the-art algorithms, namely Gaussian process and lower upper bound estimation, in a wide range of forecasting metrics. Secondly, a detailed analysis of the model's complex nonlinear relationships and interaction effects between the various features is presented. The latter allows us to interpret the model, identify some learned physical properties, explain individual predictions, reduce the computational requirements for the training without jeopardizing the model accuracy, detect possible bugs, and gain trust in the model. Finally, we conclude that the model was able to develop nonlinear relationships following human logic and intuition based on learned physical properties.

READ FULL TEXT

page 1

page 3

page 4

page 7

page 8

page 9

page 10

research
08/08/2019

Que será será? The uncertainty estimation of feature-based time series forecasts

Interval forecasts have significant advantages in providing uncertainty ...
research
09/17/2022

A review of probabilistic forecasting and prediction with machine learning

Predictions and forecasts of machine learning models should take the for...
research
06/20/2023

SkyGPT: Probabilistic Short-term Solar Forecasting Using Synthetic Sky Videos from Physics-constrained VideoGPT

In recent years, deep learning-based solar forecasting using all-sky ima...
research
02/09/2020

Cyclic Boosting – an explainable supervised machine learning algorithm

Supervised machine learning algorithms have seen spectacular advances an...
research
08/31/2021

Look Who's Talking: Interpretable Machine Learning for Assessing Italian SMEs Credit Default

Academic research and the financial industry have recently paid great at...
research
06/22/2020

Short-Term Traffic Forecasting Using High-Resolution Traffic Data

This paper develops a data-driven toolkit for traffic forecasting using ...
research
07/06/2019

XGBoostLSS -- An extension of XGBoost to probabilistic forecasting

We propose a new framework of XGBoost that predicts the entire condition...

Please sign up or login with your details

Forgot password? Click here to reset