Lost in the Shuffle: Testing Power in the Presence of Errorful Network Vertex Labels

08/18/2022
by   Ayushi Saxena, et al.
0

Many two-sample network hypothesis testing methodologies operate under the implicit assumption that the vertex correspondence across networks is a priori known. In this paper, we consider the degradation of power in two-sample graph hypothesis testing when there are misaligned/label-shuffled vertices across networks. In the context of stochastic block model networks, we theoretically explore the power loss due to shuffling for a pair of hypothesis tests based on Frobenius norm differences between estimated edge probability matrices or between adjacency matrices. The loss in testing power is further reinforced by numerous simulations and experiments, both in the stochastic block model and in the random dot product graph model, where we compare the power loss across multiple recently proposed tests in the literature. Lastly, we demonstrate the impact that shuffling can have in real-data testing in a pair of examples from neuroscience and from social network analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2021

Hypothesis Testing for Equality of Latent Positions in Random Graphs

We consider the hypothesis testing problem that two vertices i and j of ...
research
07/04/2017

Two-sample Hypothesis Testing for Inhomogeneous Random Graphs

The study of networks leads to a wide range of high dimensional inferenc...
research
05/08/2016

Information Recovery in Shuffled Graphs via Graph Matching

While many multiple graph inference methodologies operate under the impl...
research
09/07/2018

Multi-level hypothesis testing for populations of heterogeneous networks

In this work, we consider hypothesis testing and anomaly detection on da...
research
05/21/2020

Detecting a botnet in a network

We formalize the problem of detecting the presence of a botnet in a netw...
research
07/05/2023

Federated Epidemic Surveillance

The surveillance of a pandemic is a challenging task, especially when cr...
research
06/24/2019

Inference for multiple heterogeneous networks with a common invariant subspace

The development of models for multiple heterogeneous network data is of ...

Please sign up or login with your details

Forgot password? Click here to reset