Gradient boosting for extreme quantile regression

03/01/2021
by   Jasper Velthoen, et al.
0

Extreme quantile regression provides estimates of conditional quantiles outside the range of the data. Classical methods such as quantile random forests perform poorly in such cases since data in the tail region are too scarce. Extreme value theory motivates to approximate the conditional distribution above a high threshold by a generalized Pareto distribution with covariate dependent parameters. This model allows for extrapolation beyond the range of observed values and estimation of conditional extreme quantiles. We propose a gradient boosting procedure to estimate a conditional generalized Pareto distribution by minimizing its deviance. Cross-validation is used for the choice of tuning parameters such as the number of trees and the tree depths. We discuss diagnostic plots such as variable importance and partial dependence plots, which help to interpret the fitted models. In simulation studies we show that our gradient boosting procedure outperforms classical methods from quantile regression and extreme value theory, especially for high-dimensional predictor spaces and complex parameter response surfaces. An application to statistical post-processing of weather forecasts with precipitation data in the Netherlands is proposed.

READ FULL TEXT

page 27

page 31

research
01/30/2022

Extremal Random Forests

Classical methods for quantile regression fail in cases where the quanti...
research
04/17/2019

Estimation and uncertainty quantification for extreme quantile regions

Estimation of extreme quantile regions, spaces in which future extreme e...
research
12/09/2018

Constant versus Covariate Dependent Threshold in the Peaks-Over Threshold Method

The Peaks-Over Threshold is a fundamental method in the estimation of ra...
research
10/18/2021

Gradient boosting with extreme-value theory for wildfire prediction

This paper details the approach of the team Kohrrelation in the 2021 Ext...
research
12/03/2019

Risk-Averse Action Selection Using Extreme Value Theory Estimates of the CVaR

The Conditional Value-at-Risk (CVaR) is a useful risk measure in machine...
research
01/29/2020

On the behavior of extreme d-dimensional spatial quantiles under minimal assumptions

"Spatial" or "geometric" quantiles are the only multivariate quantiles c...
research
12/15/2022

Learning Inter-Annual Flood Loss Risk Models From Historical Flood Insurance Claims and Extreme Rainfall Data

Flooding is one of the most disastrous natural hazards, responsible for ...

Please sign up or login with your details

Forgot password? Click here to reset