Comparison of tree-based ensemble algorithms for merging satellite and earth-observed precipitation data at the daily time scale

Merging satellite products and ground-based measurements is often required for obtaining precipitation datasets that simultaneously cover large regions with high density and are more accurate than pure satellite precipitation products. Machine and statistical learning regression algorithms are regularly utilized in this endeavour. At the same time, tree-based ensemble algorithms are adopted in various fields for solving regression problems with high accuracy and low computational cost. Still, information on which tree-based ensemble algorithm to select for correcting satellite precipitation products for the contiguous United States (US) at the daily time scale is missing from the literature. In this study, we worked towards filling this methodological gap by conducting an extensive comparison between three algorithms of the category of interest, specifically between random forests, gradient boosting machines (gbm) and extreme gradient boosting (XGBoost). We used daily data from the PERSIANN (Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks) and the IMERG (Integrated Multi-satellitE Retrievals for GPM) gridded datasets. We also used earth-observed precipitation data from the Global Historical Climatology Network daily (GHCNd) database. The experiments referred to the entire contiguous US and additionally included the application of the linear regression algorithm for benchmarking purposes. The results suggest that XGBoost is the best-performing tree-based ensemble algorithm among those compared...

READ FULL TEXT

page 6

page 7

page 12

page 16

research
12/17/2022

Comparison of machine learning algorithms for merging gridded satellite and earth-observed precipitation data

Gridded satellite precipitation datasets are useful in hydrological appl...
research
07/09/2023

Ensemble learning for blending gridded satellite and gauge-measured precipitation data

Regression algorithms are regularly used for improving the accuracy of s...
research
04/11/2013

Merging Satellite Measurements of Rainfall Using Multi-scale Imagery Technique

Several passive microwave satellites orbit the Earth and measure rainfal...
research
06/04/2017

InfiniteBoost: building infinite ensembles with gradient descent

In machine learning ensemble methods have demonstrated high accuracy for...
research
06/03/2021

Gradient Boosted Binary Histogram Ensemble for Large-scale Regression

In this paper, we propose a gradient boosting algorithm for large-scale ...
research
09/09/2019

Super learning for daily streamflow forecasting: Large-scale demonstration and comparison with multiple machine learning algorithms

Daily streamflow forecasting through data-driven approaches is tradition...
research
08/16/2021

Nowcasting-Nets: Deep Neural Network Structures for Precipitation Nowcasting Using IMERG

Accurate and timely estimation of precipitation is critical for issuing ...

Please sign up or login with your details

Forgot password? Click here to reset