A metric on directed graphs and Markov chains based on hitting probabilities

06/25/2020
by   Zachary M. Boyd, et al.
0

The shortest-path, commute time, and diffusion distances on undirected graphs have been widely employed in applications such as dimensionality reduction, link prediction, and trip planning. Increasingly, there is interest in using asymmetric structure of data derived from Markov chains and directed graphs, but few metrics are specifically adapted to this task. We introduce a metric on the state space of any ergodic, finite-state, time-homogeneous Markov chain and, in particular, on any Markov chain derived from a directed graph. Our construction is based on hitting probabilities, with nearness in the metric space related to the transfer of random walkers from one node to another at stationarity. Notably, our metric is insensitive to shortest and average path distances, thus giving new information compared to existing metrics. We use possible degeneracies in the metric to develop an interesting structural theory of directed graphs and explore a related quotienting procedure. Our metric can be computed in O(n^3) time, where n is the number of states, and in examples we scale up to n=10,000 nodes and ≈ 38M edges on a desktop computer. In several examples, we explore the nature of the metric, compare it to alternative methods, and demonstrate its utility for weak recovery of community structure in dense graphs, visualization, structure recovering, dynamics exploration, and multiscale cluster detection.

READ FULL TEXT

page 17

page 18

page 21

research
02/25/2022

Kron Reduction and Effective Resistance of Directed Graphs

In network theory, the concept of the effective resistance is a distance...
research
11/25/2021

On the Strong Metric Dimension of directed co-graphs

Let G be a strongly connected directed graph and u,v,w∈ V(G) be three ve...
research
06/13/2018

Path-entropy maximized Markov chains for dimensionality reduction

Stochastic kernel based dimensionality reduction methods have become pop...
research
11/04/2019

Multilateration of Random Networks with Community Structure

The minimal number of nodes required to multilaterate a network endowed ...
research
02/16/2023

Distances for Markov Chains, and Their Differentiation

(Directed) graphs with node attributes are a common type of data in vari...
research
03/30/2023

Data-driven abstractions via adaptive refinements and a Kantorovich metric [extended version]

We introduce an adaptive refinement procedure for smart, and scalable ab...
research
10/11/2018

Applications of PageRank to Function Comparison and Malware Classification

We classify .NET files as either benign or malicious by examining certai...

Please sign up or login with your details

Forgot password? Click here to reset