SILVAN: Estimating Betweenness Centralities with Progressive Sampling and Non-uniform Rademacher Bounds

06/07/2021
by   Leonardo Pellegrina, et al.
0

Betweenness centrality is a popular centrality measure with applications in several domains, and whose exact computation is impractical for modern-sized networks. We present SILVAN, a novel, efficient algorithm to compute, with high probability, accurate estimates of the betweenness centrality of all nodes of a graph and a high-quality approximation of the k most central nodes of a graph. SILVAN follows a progressive sampling approach, and builds on recently improved bounds on Monte-Carlo Empirical Rademacher Averages, a fundamental tool from statistical learning theory. SILVAN relies on a novel estimation scheme that leads to non-uniform bounds on the deviation of the estimates from the true values of the between centrality of all the nodes, providing tight guarantees on the quality of the approximation. Our extensive experimental evaluation shows that SILVAN extracts high-quality approximations while outperforming, in terms of number of samples and accuracy, the state-of-the-art approximation algorithm with comparable quality guarantees.

READ FULL TEXT
research
06/06/2023

Efficient Centrality Maximization with Rademacher Averages

The identification of the set of k most central nodes of a graph, or cen...
research
03/01/2022

ONBRA: Rigorous Estimation of the Temporal Betweenness Centrality in Temporal Networks

In network analysis, the betweenness centrality of a node informally cap...
research
04/17/2023

On approximating the temporal betweenness centrality through sampling

We present a collection of sampling-based algorithms for approximating t...
research
01/18/2021

PRESTO: Simple and Scalable Sampling Techniques for the Rigorous Approximation of Temporal Motif Counts

The identification and counting of small graph patterns, called network ...
research
03/03/2020

Scalable Distributed Approximation of Internal Measures for Clustering Evaluation

The most widely used internal measure for clustering evaluation is the s...
research
08/19/2021

odeN: Simultaneous Approximation of Multiple Motif Counts in Large Temporal Networks

Counting the number of occurrences of small connected subgraphs, called ...
research
12/18/2018

Uniform Convergence Bounds for Codec Selection

We frame the problem of selecting an optimal audio encoding scheme as a ...

Please sign up or login with your details

Forgot password? Click here to reset