Estimation of circular statistics in the presence of measurement bias

09/21/2022
by   Abdallah Alsammani, et al.
0

Background and objective. Circular statistics and Rayleigh tests are important tools for analyzing the occurrence of cyclic events. However, current methods fail in the presence of measurement bias, such as incomplete or otherwise non-uniform sampling. Consider, for example, studying 24-cyclicity but having data not recorded uniformly over the full 24-hour cycle. The objective of this paper is to present a method to estimate circular statistics and their statistical significance even in this circumstance. Methods. We present our objective as a special case of a more general problem: estimating probability distributions in the context of imperfect measurements, a highly studied problem in high energy physics. Our solution combines 1) existing approaches that estimate the measurement process via numeric simulation and 2) innovative use of linear parametrizations of the underlying distributions. We compute the estimation error for several toy examples as well as a real-world example: analyzing the 24-hour cyclicity of an electrographic biomarker of epileptic tissue controlled for state of vigilance. Results. Our method shows low estimation error. In a real-world example, we observed the corrected moments had a root mean square residual less than 0.007. We additionally found that, even with unfolding, Rayleigh test statistics still often underestimate the p-values (and thus overestimate statistical significance) in the presence of non-uniform sampling. Numerical estimation of statistical significance, as described herein, is thus preferable. Conclusions. The presented methods provide a robust solution to addressing incomplete or otherwise non-uniform sampling. The general method presented is also applicable to a wider set of analyses involving estimation of the true probability distribution adjusted for imperfect measurement processes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/20/2020

Improving Sampling Accuracy of Stochastic Gradient MCMC Methods via Non-uniform Subsampling of Gradients

Common Stochastic Gradient MCMC methods approximate gradients by stochas...
research
09/13/2019

Order statistics on the spacings between order statistics for the uniform distribution

Closed-form expressions for the distributions of the order statistics on...
research
07/11/2019

Method of moments for 3-D single particle ab initio modeling with non-uniform distribution of viewing angles

Single-particle reconstruction in cryo-electron microscopy (cryo-EM) is ...
research
12/12/2019

Statistical Analysis of the Phosphate Data of the World Ocean Database 2013

The phosphate data of the World Ocean Database 2013 are extensively stat...
research
07/29/2020

Non-Uniform Sampling of Fixed Margin Uniform Matrices

Data sets in the form of binary matrices are ubiquitous across scientifi...
research
08/24/2020

On sampling from data with duplicate records

Data deduplication is the task of detecting records in a database that c...
research
11/01/2017

Geostatistical inference in the presence of geomasking: a composite-likelihood approach

In almost any geostatistical analysis, one of the underlying, often impl...

Please sign up or login with your details

Forgot password? Click here to reset