A bag-of-paths framework for network data analysis

02/27/2013
by   Kevin Françoisse, et al.
0

This work develops a generic framework, called the bag-of-paths (BoP), for link and network data analysis. The central idea is to assign a probability distribution on the set of all paths in a network. More precisely, a Gibbs-Boltzmann distribution is defined over a bag of paths in a network, that is, on a representation that considers all paths independently. We show that, under this distribution, the probability of drawing a path connecting two nodes can easily be computed in closed form by simple matrix inversion. This probability captures a notion of relatedness between nodes of the graph: two nodes are considered as highly related when they are connected by many, preferably low-cost, paths. As an application, two families of distances between nodes are derived from the BoP probabilities. Interestingly, the second distance family interpolates between the shortest path distance and the resistance distance. In addition, it extends the Bellman-Ford formula for computing the shortest path distance in order to integrate sub-optimal paths by simply replacing the minimum operator by the soft minimum operator. Experimental results on semi-supervised classification show that both of the new distance families are competitive with other state-of-the-art approaches. In addition to the distance measures studied in this paper, the bag-of-paths framework enables straightforward computation of many other relevant network measures.

READ FULL TEXT
research
06/07/2018

Randomized Optimal Transport on a Graph: Framework and New Distance Measures

The recently developed bag-of-paths framework consists in setting a Gibb...
research
10/16/2012

Semi-Supervised Classification Through the Bag-of-Paths Group Betweenness

This paper introduces a novel, well-founded, betweenness measure, called...
research
02/08/2019

Covariance and Correlation Kernels on a Graph in the Generalized Bag-of-Paths Formalism

This work derives closed-form expressions computing the expectation of c...
research
12/15/2021

Maximum likelihood estimation for randomized shortest paths with trajectory data

Randomized shortest paths (RSP) are a tool developed in recent years for...
research
11/03/2021

Geodesic statistics for random network families

A key task in the study of networked systems is to derive local and glob...
research
09/15/2017

On the stability of network indices defined by means of matrix functions

Identifying important components in a network is one of the major goals ...
research
08/23/2021

Relative Entropy-Regularized Optimal Transport on a Graph: a new algorithm and an experimental comparison

Following [21, 23], the present work investigates a new relative entropy...

Please sign up or login with your details

Forgot password? Click here to reset