Testing Tail Weight of a Distribution Via Hazard Rate

10/06/2020
by   Maryam Aliakbarpour, et al.
0

Understanding the shape of a distribution of data is of interest to people in a great variety of fields, as it may affect the types of algorithms used for that data. Given samples from a distribution, we seek to understand how many elements appear infrequently, that is, to characterize the tail of the distribution. We develop an algorithm based on a careful bucketing scheme that distinguishes heavy-tailed distributions from non-heavy-tailed ones via a definition based on the hazard rate under some natural smoothness and ordering assumptions. We verify our theoretical results empirically.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 1

page 2

page 3

page 4

05/23/2021

Compressing Heavy-Tailed Weight Matrices for Non-Vacuous Generalization Bounds

Heavy-tailed distributions have been studied in statistics, random matri...
05/09/2014

Gaussian-Chain Filters for Heavy-Tailed Noise with Application to Detecting Big Buyers and Big Sellers in Stock Market

We propose a new heavy-tailed distribution --- Gaussian-Chain (GC) distr...
01/26/2021

Ethereum ECCPoW

The error-correction code based proof-of-work (ECCPoW) algorithm is base...
04/26/2022

An Empirical Study of the Occurrence of Heavy-Tails in Training a ReLU Gate

A particular direction of recent advance about stochastic deep-learning ...
02/15/2019

Heavy-tailed kernels reveal a finer cluster structure in t-SNE visualisations

T-distributed stochastic neighbour embedding (t-SNE) is a widely used da...
08/13/2019

L2P: An Algorithm for Estimating Heavy-tailed Outcomes

Many real-world prediction tasks have outcome (a.k.a. target or response...
01/20/2022

Heavy-tailed Sampling via Transformed Unadjusted Langevin Algorithm

We analyze the oracle complexity of sampling from polynomially decaying ...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.