Estimating Extreme Value Index by Subsampling for Massive Datasets with Heavy-Tailed Distributions

07/04/2020
by   Yongxin Li, et al.
0

Modern statistical analyses often encounter datasets with massive sizes and heavy-tailed distributions. For datasets with massive sizes, traditional estimation methods can hardly be used to estimate the extreme value index directly. To address the issue, we propose here a subsampling-based method. Specifically, multiple subsamples are drawn from the whole dataset by using the technique of simple random subsampling with replacement. Based on each subsample, an approximate maximum likelihood estimator can be computed. The resulting estimators are then averaged to form a more accurate one. Under appropriate regularity conditions, we show theoretically that the proposed estimator is consistent and asymptotically normal. With the help of the estimated extreme value index, a normal range can be established for a heavy-tailed random variable. Observations that fall outside the range should be treated as suspected records and can be practically regarded as outliers. Extensive simulation experiments are provided to demonstrate the promising performance of our method. A real data analysis is also presented for illustration purpose.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/16/2021

A Reduced-Bias Weighted least square estimation of the Extreme Value Index

In this paper, we propose a reduced-bias estimator of the EVI for Pareto...
research
05/12/2021

Trimmed extreme value estimators for censored heavy-tailed data

We consider estimation of the extreme value index and extreme quantiles ...
research
04/13/2023

Subsampling and Jackknifing: A Practically Convenient Solution for Large Data Analysis with Limited Computational Resources

Modern statistical analysis often encounters datasets with large sizes. ...
research
08/23/2018

Data-adaptive trimming of the Hill estimator and detection of outliers in the extremes of heavy-tailed data

We introduce a trimmed version of the Hill estimator for the index of a ...
research
12/20/2017

Extreme Value Analysis Without the Largest Values: What Can Be Done?

In this paper we are concerned with the analysis of heavy-tailed data wh...
research
10/05/2022

Extreme expectile estimation for short-tailed data

The use of expectiles in risk management contexts has recently gathered ...
research
07/07/2023

Hill estimator and extreme quantile estimator for functionals of approximated stochastic processes

We study the effect of approximation errors in assessing the extreme beh...

Please sign up or login with your details

Forgot password? Click here to reset