The Four Point Permutation Test for Latent Block Structure in Incidence Matrices

10/04/2018
by   R W R Darling, et al.
0

Transactional data may be represented as a bipartite graph G:=(L ∪ R, E), where L denotes agents, R denotes objects visible to many agents, and an edge in E denotes an interaction between an agent and an object. Unsupervised learning seeks to detect block structures in the adjacency matrix Z between L and R, thus grouping together sets of agents with similar object interactions. New results on quasirandom permutations suggest a non-parametric four point test to measure the amount of block structure in G, with respect to vertex orderings on L and R. Take disjoint 4-edge random samples, order these four edges by left endpoint, and count the relative frequencies of the 4! possible orderings of the right endpoint. When these orderings are equiprobable, the edge set E corresponds to a quasirandom permutation π of |E| symbols. Total variation distance of the relative frequency vector away from the uniform distribution on 24 permutations measures the amount of block structure. Such a test statistic, based on |E|/4 samples, is computable in O(|E|/p) time on p processors. Possibly block structure may be enhanced by precomputing natural orders on L and R, related to the second eigenvector of graph Laplacians. In practice this takes O(d |E|) time, where d is the graph diameter. Five open problems are described.

READ FULL TEXT
research
03/26/2020

A Blind Permutation Similarity Algorithm

This paper introduces a polynomial blind algorithm that determines when ...
research
03/26/2021

Beyond the adjacency matrix: random line graphs and inference for networks with edge attributes

Any modern network inference paradigm must incorporate multiple aspects ...
research
10/22/2020

Vertex deletion into bipartite permutation graphs

A permutation graph can be defined as an intersection graph of segments ...
research
09/06/2020

Vertex-Domatic, Edge-Domatic and Total Domatic Number of Uniform Hypergraphs

E. J. Cockayne and S. T. Hedetniemi introduced the concept of domatic nu...
research
11/16/2022

Two-Sample Test for Stochastic Block Models via the Largest Singular Value

The stochastic block model is widely used for detecting community struct...
research
12/08/2021

Consistency of Spectral Seriation

Consider a random graph G of size N constructed according to a graphon w...
research
11/28/2007

Representation and Measure of Structural Information

We introduce a uniform representation of general objects that captures t...

Please sign up or login with your details

Forgot password? Click here to reset