Non-Uniform Sampling of Fixed Margin Uniform Matrices

07/29/2020
by   Alex Fout, et al.
0

Data sets in the form of binary matrices are ubiquitous across scientific domains, and researchers are often interested in identifying and quantifying noteworthy structure. One approach is to compare the observed data to that which might be obtained under a null model. Here we consider sampling from the space of binary matrices which satisfy a set of marginal row and column sums. Whereas existing sampling methods have focused on uniform sampling from this space, we introduce modified versions of two elementwise swapping algorithms which sample according to a non-uniform probability distribution defined by a weight matrix, which gives the relative probability of a one for each entry. We demonstrate that values of zero in the weight matrix, i.e. structural zeros, are generally problematic for swapping algorithms, except when they have special monotonic structure. We explore the properties of our algorithms through simulation studies, and illustrate the potential impact of employing a non-uniform null model using a classic bird habitation dataset.

READ FULL TEXT

page 12

page 14

page 19

research
12/07/2021

fastball: A fast algorithm to sample binary matrices with fixed marginals

Many applications require randomly sampling binary graphs with fixed deg...
research
01/24/2017

By chance is not enough: Preserving relative density through non uniform sampling

Dealing with visualizations containing large data set is a challenging i...
research
06/15/2019

Reinforcement Learning with Non-uniform State Representations for Adaptive Search

Efficient spatial exploration is a key aspect of search and rescue. In t...
research
10/30/2019

Weighted matrix completion from non-random, non-uniform sampling patterns

We study the matrix completion problem when the observation pattern is d...
research
04/08/2019

A Fast Scheme for the Uniform Sampling of Binary Matrices with Fixed Margins

Uniform sampling of binary matrix with fixed margins is an important and...
research
11/10/2021

Power-of-two Policies in Redundancy Systems: the Impact of Assignment Constraints

In classical power-of-two load balancing any server pair is sampled with...
research
09/21/2022

Estimation of circular statistics in the presence of measurement bias

Background and objective. Circular statistics and Rayleigh tests are imp...

Please sign up or login with your details

Forgot password? Click here to reset