Benford's Law Beyond Independence: Tracking Benford Behavior in Copula Models

12/31/2017
by   Rebecca F. Durst, et al.
0

Benford's law describes a common phenomenon among many naturally occurring data sets and distributions in which the leading digits of the data are distributed with the probability of a first digit of d base B being _Bd+1/d. As it often successfully detects fraud in medical trials, voting, science and finance, significant effort has been made to understand when and how distributions exhibit Benford behavior. Most of the previous work has been restricted to cases of independent variables, and little is known about situations involving dependence. We use copulas to investigate the Benford behavior of the product of n dependent random variables. We develop a method for approximating the Benford behavior of a product of n dependent random variables modeled by a copula distribution C and quantify and bound a copula distribution's distance from Benford behavior. We then investigate the Benford behavior of various copulas under varying dependence parameters and number of marginals. Our investigations show that the convergence to Benford behavior seen with independent random variables as the number of variables in the product increases is not necessarily preserved when the variables are dependent and modeled by a copula. Furthermore, there is strong indication that the preservation of Benford behavior of the product of dependent random variables may be linked more to the structure of the copula than to the Benford behavior of the marginal distributions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/27/2018

A Kolmogorov-Smirnov type test for two inter-dependent random variables

Consider n iid random variables, where ξ_1, ..., ξ_n are n realisations ...
research
11/05/2021

Why the 1-Wasserstein distance is the area between the two marginal CDFs

We elucidate why the 1-Wasserstein distance W_1 coincides with the area ...
research
12/13/2017

Limit theorems for the Multiplicative Binomial Distribution (MBD)

The sum of n non-independent Bernoulli random variables could be modeled...
research
03/05/2020

A strong law of large numbers for simultaneously testing parameters of Lancaster bivariate distributions

We prove a strong law of large numbers for simultaneously testing parame...
research
03/03/2018

An extension of Azzalini's method

The aim of this paper is to extend Azzalini's method. This extension is ...
research
06/27/2022

Robustness Implies Generalization via Data-Dependent Generalization Bounds

This paper proves that robustness implies generalization via data-depend...

Please sign up or login with your details

Forgot password? Click here to reset