On Tail Decay Rate Estimation of Loss Function Distributions

06/05/2023
by   Etrit Haxholli, et al.
0

The study of loss function distributions is critical to characterize a model's behaviour on a given machine learning problem. For example, while the quality of a model is commonly determined by the average loss assessed on a testing set, this quantity does not reflect the existence of the true mean of the loss distribution. Indeed, the finiteness of the statistical moments of the loss distribution is related to the thickness of its tails, which are generally unknown. Since typical cross-validation schemes determine a family of testing loss distributions conditioned on the training samples, the total loss distribution must be recovered by marginalizing over the space of training sets. As we show in this work, the finiteness of the sampling procedure negatively affects the reliability and efficiency of classical tail estimation methods from the Extreme Value Theory, such as the Peaks-Over-Threshold approach. In this work we tackle this issue by developing a novel general theory for estimating the tails of marginal distributions, when there exists a large variability between locations of the individual conditional distributions underlying the marginal. To this end, we demonstrate that under some regularity conditions, the shape parameter of the marginal distribution is the maximum tail shape parameter of the family of conditional distributions. We term this estimation approach as Cross Tail Estimation (CTE). We test cross-tail estimation in a series of experiments on simulated and real data, showing the improved robustness and quality of tail estimation as compared to classical approaches, and providing evidence for the relationship between overfitting and loss distribution tail thickness.

READ FULL TEXT
research
05/25/2021

On the Tail Behaviour of Aggregated Random Variables

In many areas of interest, modern risk assessment requires estimation of...
research
05/25/2018

Body and Tail - Separating the distribution function by an efficient tail-detecting procedure in risk management

In risk management, tail risks are of crucial importance. The quality of...
research
03/28/2018

Repeated out of Sample Fusion in the Estimation of Small Tail Probabilities

In pursuit of a small tail probability p, it is shown how to construct b...
research
05/16/2022

Fat-Tailed Variational Inference with Anisotropic Tail Adaptive Flows

While fat-tailed densities commonly arise as posterior and marginal dist...
research
02/04/2018

INLA goes extreme: Bayesian tail regression for the estimation of high spatio-temporal quantiles

This work has been motivated by the challenge of the 2017 conference on ...
research
06/21/2023

Modile as a conservative tail risk measurer: the solution of an optimisation problem with 0-1 loss function

Quantiles and expectiles, which are two important concepts and tools in ...
research
09/08/2020

Empirical Strategy for Stretching Probability Distribution in Neural-network-based Regression

In regression analysis under artificial neural networks, the prediction ...

Please sign up or login with your details

Forgot password? Click here to reset