Fast computation of p-values for the permutation test based on Pearson's correlation coefficient and other statistical tests

07/26/2018
by   Jean-Marie Droz, et al.
0

Permutation tests are among the simplest and most widely used statistical tools. Their p-values can be computed by a straightforward sampling of permutations. However, this way of computing p-values is often so slow that it is replaced by an approximation, which is accurate only for part of the interesting range of parameters. Moreover, the accuracy of the approximation can usually not be improved by increasing the computation time. We introduce a new sampling-based algorithm which uses the fast Fourier transform to compute p-values for the permutation test based on Pearson's correlation coefficient. The algorithm is practically and asymptotically faster than straightforward sampling. Typically, its complexity is logarithmic in the input size, while the complexity of straightforward sampling is linear. The idea behind the algorithm can also be used to accelerate the computation of p-values for many other common statistical tests. The algorithm is easy to implement, but its analysis involves results from the representation theory of the symmetric group.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/28/2018

Extremely efficient permutation and bootstrap hypothesis tests using R

Re-sampling based statistical tests are known to be computationally heav...
research
09/07/2021

A note on the permutation distribution of generalized correlation coefficients

We provide sufficient conditions for the asymptotic normality of the gen...
research
12/05/2019

Another look at the Lady Tasting Tea and permutation-based randomization tests

Fisher's famous Lady Tasting Tea experiment is often referred to as the ...
research
12/23/2019

Study on upper limit of sample sizes for a two-level test in NIST SP800-22

NIST SP800-22 is one of the widely used statistical testing tools for ps...
research
04/29/2018

Distribution-Free, Size Adaptive Submatrix Detection with Acceleration

Given a large matrix containing independent data entries, we consider th...
research
07/16/2014

Large scale canonical correlation analysis with iterative least squares

Canonical Correlation Analysis (CCA) is a widely used statistical tool w...
research
07/26/2017

A Note on Implementing a Special Case of the LEAR Covariance Model in Standard Software

Repeated measures analyses require proper choice of the correlation mode...

Please sign up or login with your details

Forgot password? Click here to reset