Histograms lie about distribution shapes and Pearson's coefficient of variation lies about variability

11/12/2021
by   Paulo S. P. Silveira, et al.
0

Background and Objective: Histograms and Pearson's coefficient of variation are among the most popular summary statistics. Researchers use them to judge the shape of quantitative data distribution by visual inspection of histograms. The coefficient of variation is taken as an estimator of relative variability of these data. We explore properties of histograms and coefficient of variation by examples in R, thus offering better alternatives: density plots and Eisenhauer's relative dispersion coefficient. Methods: Hypothetical examples developed in R are applied to create histograms and density and to compute coefficient of variation and relative dispersion coefficient. Results: These hypothetical examples clearly show that these two traditional approaches are flawed. Histograms are incapable of reflecting the distribution of probabilities and the coefficient of variation has issues with negative and positive values in the same dataset, it is sensible to outliers, and it is severely affected by mean value of a distribution. Potential replacements are explained and applied for contrast. Conclusions: With the use of modern computers and R language it is easy to replace histograms by density plots, which are able to approximate the theoretical probability distribution. In addition, Eisenhauer's relative dispersion coefficient is suggested as a suitable estimator of relative variability, including corrections for lower and upper bounds.

READ FULL TEXT
research
06/09/2021

Relative Clustering Coefficient

In this paper, we relatively extend the definition of global clustering ...
research
12/08/2019

Individual predictions matter: Assessing the effect of data ordering in training fine-tuned CNNs for medical imaging

We reproduced the results of CheXNet with fixed hyperparameters and 50 d...
research
05/18/2017

Information Density as a Factor for Variation in the Embedding of Relative Clauses

In German, relative clauses can be positioned in-situ or extraposed. A p...
research
01/28/2022

Two more ways of spelling Gini Coefficient with Applications

In this paper, we draw attention to a promising yet slightly underestima...
research
08/20/2023

A goodness-of-fit test for the Birnbaum-Saunders distribution based on the probability plot

In the present paper, we develop a new goodness-of-fit test for the Birn...
research
07/13/2017

Small Sample Inference for the Common Coefficient of Variation

This paper utilizes the modified signed log-likelihood ratio method for ...

Please sign up or login with your details

Forgot password? Click here to reset