Information-theoretic Limits for Testing Community Structures in Weighted Networks

04/19/2022
by   Mingao Yuan, et al.
0

Community detection refers to the problem of clustering the nodes of a network into groups. Existing inferential methods for community structure mainly focus on unweighted (binary) networks. Many real-world networks are nonetheless weighted and a common practice is to dichotomize a weighted network to an unweighted one which is known to result in information loss. Literature on hypothesis testing in the latter situation is still missing. In this paper, we study the problem of testing the existence of community structure in weighted networks. Our contributions are threefold: (a). We use the (possibly infinite-dimensional) exponential family to model the weights and derive the sharp information-theoretic limit for the existence of consistent test. Within the limit, any test is inconsistent; and beyond the limit, we propose a useful consistent test. (b). Based on the information-theoretic limits, we provide the first formal way to quantify the loss of information incurred by dichotomizing weighted graphs into unweighted graphs in the context of hypothesis testing. (c). We propose several new and practically useful test statistics. Simulation study show that the proposed tests have good performance. Finally, we apply the proposed tests to an animal social network.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/26/2023

Empirical likelihood test for community structure in networks

Network data, characterized by interconnected nodes and edges, is pervas...
research
01/29/2021

A Practical Two-Sample Test for Weighted Random Graphs

Network (graph) data analysis is a popular research topic in statistics ...
research
11/04/2021

Community detection in censored hypergraph

Community detection refers to the problem of clustering the nodes of a n...
research
07/20/2021

Limits of Detecting Extraterrestrial Civilizations

The search for extraterrestrial intelligence (SETI) is a scientific ende...
research
11/24/2020

A spectral-based framework for hypothesis testing in populations of networks

In this paper, we propose a new spectral-based approach to hypothesis te...
research
12/29/2020

Resolution limit revisited: community detection using generalized modularity density

Various attempts have been made in recent years to solve the Resolution ...
research
09/13/2016

Information Theoretic Structure Learning with Confidence

Information theoretic measures (e.g. the Kullback Liebler divergence and...

Please sign up or login with your details

Forgot password? Click here to reset