Concentration bounds for the empirical angular measure with statistical learning applications

04/07/2021
by   Stephan Clémençon, et al.
0

The angular measure on the unit sphere characterizes the first-order dependence structure of the components of a random vector in extreme regions and is defined in terms of standardized margins. Its statistical recovery is an important step in learning problems involving observations far away from the center. In the common situation when the components of the vector have different distributions, the rank transformation offers a convenient and robust way of standardizing data in order to build an empirical version of the angular measure based on the most extreme observations. However, the study of the sampling distribution of the resulting empirical angular measure is challenging. It is the purpose of the paper to establish finite-sample bounds for the maximal deviations between the empirical and true angular measures, uniformly over classes of Borel sets of controlled combinatorial complexity. The bounds are valid with high probability and scale essentially as the square root of the effective sample size, up to a logarithmic factor. Discarding the most extreme observations yields a truncated version of the empirical angular measure for which the logarithmic factor in the concentration bound is replaced by a factor depending on the truncation level. The bounds are applied to provide performance guarantees for two statistical learning procedures tailored to extreme regions of the input space and built upon the empirical angular measure: binary classification in extreme regions through empirical risk minimization and unsupervised anomaly detection through minimum-volume sets of the sphere.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/26/2023

An asymptotic expansion of the empirical angular measure for bivariate extremal dependence

The angular measure on the unit sphere characterizes the first-order dep...
research
03/31/2016

Sparse Representation of Multivariate Extremes with Applications to Anomaly Ranking

Extremes play a special role in Anomaly Detection. Beyond inference and ...
research
09/10/2023

Testing the Regular Variation Model for Multivariate Extremes with Flexible Circular and Spherical Distributions

The regular variation model for multivariate extremes decomposes the joi...
research
03/06/2023

On Regression in Extreme Regions

In the classic regression problem, the value of a real-valued random var...
research
11/15/2021

Spectral learning of multivariate extremes

We propose a spectral clustering algorithm for analyzing the dependence ...
research
06/26/2019

Principal Component Analysis for Multivariate Extremes

The first order behavior of multivariate heavy-tailed random vectors abo...
research
10/25/2021

A Constructive Proof of the Glivenko-Cantelli Theorem

The Glivenko-Cantelli theorem states that the empirical distribution fun...

Please sign up or login with your details

Forgot password? Click here to reset