A LightGBM based Forecasting of Dominant Wave Periods in Oceanic Waters

05/18/2021
by   Pujan Pokhrel, et al.
0

In this paper, we propose a Light Gradient Boosting (LightGBM) to forecast dominant wave periods in oceanic waters. First, we use the data collected from CDIP buoys and apply various data filtering methods. The data filtering methods allow us to obtain a high-quality dataset for training and validation purposes. We then extract various wave-based features like wave heights, periods, skewness, kurtosis, etc., and atmospheric features like humidity, pressure, and air temperature for the buoys. Afterward, we train algorithms that use LightGBM and Extra Trees through a hv-block cross-validation scheme to forecast dominant wave periods for up to 30 days ahead. LightGBM has the R2 score of 0.94, 0.94, and 0.94 for 1-day ahead, 15-day ahead, and 30-day ahead prediction. Similarly, Extra Trees (ET) has an R2 score of 0.88, 0.86, and 0.85 for 1-day ahead, 15-day ahead, and 30 day ahead prediction. In case of the test dataset, LightGBM has R2 score of 0.94, 0.94, and 0.94 for 1-day ahead, 15-day ahead and 30-day ahead prediction. ET has R2 score of 0.88, 0.86, and 0.85 for 1-day ahead, 15-day ahead, and 30-day ahead prediction. A similar R2 score for both training and the test dataset suggests that the machine learning models developed in this paper are robust. Since the LightGBM algorithm outperforms ET for all the windows tested, it is taken as the final algorithm. Note that the performance of both methods does not decrease significantly as the forecast horizon increases. Likewise, the proposed method outperforms the numerical approaches included in this paper in the test dataset. For 1 day ahead prediction, the proposed algorithm has SI, Bias, CC, and RMSE of 0.09, 0.00, 0.97, and 1.78 compared to 0.268, 0.40, 0.63, and 2.18 for the European Centre for Medium-range Weather Forecasts (ECMWF) model, which outperforms all the other methods in the test dataset.

READ FULL TEXT
research
05/18/2021

Forecasting Significant Wave Heights in Oceanic Waters

This paper proposes a machine learning method based on the Extra Trees (...
research
11/06/2018

Day-ahead time series forecasting: application to capacity planning

In the context of capacity planning, forecasting the evolution of inform...
research
07/13/2021

Smoothed Bernstein Online Aggregation for Day-Ahead Electricity Demand Forecasting

We present a winning method of the IEEE DataPort Competition on Day-Ahea...
research
08/13/2020

A Novel CMAQ-CNN Hybrid Model to Forecast Hourly Surface-Ozone Concentrations Fourteen Days in Advance

Issues regarding air quality and related health concerns have prompted t...
research
02/08/2020

Romance in China: Mining and Visualizing 10 Million Alibaba Valentine Purchases

Valentine Day February 14, is the day of love. The days ahead of Valenti...
research
09/29/2021

Digital Twins based Day-ahead Integrated Energy System Scheduling under Load and Renewable Energy Uncertainties

By constructing digital twins (DT) of an integrated energy system (IES),...
research
03/09/2019

Estimating Dynamic Conditional Spread Densities to Optimise Daily Storage Trading of Electricity

This paper formulates dynamic density functions, based upon skewed-t and...

Please sign up or login with your details

Forgot password? Click here to reset