Online Factorization and Partition of Complex Networks From Random Walks

05/22/2017
by   Lin F. Yang, et al.
0

Finding the reduced-dimensional structure is critical to understanding complex networks. Existing approaches such as spectral clustering are applicable only when the full network is explicitly observed. In this paper, we focus on the online factorization and partition of implicit large-scale networks based on observations from an associated random walk. We formulate this into a nonconvex stochastic factorization problem and propose an efficient and scalable stochastic generalized Hebbian algorithm. The algorithm is able to process dependent state-transition data dynamically generated by the underlying network and learn a low-dimensional representation for each vertex. By applying a diffusion approximation analysis, we show that the continuous-time limiting process of the stochastic algorithm converges globally to the "principal components" of the Markov chain and achieves a nearly optimal sample complexity. Once given the learned low-dimensional representations, we further apply clustering techniques to recover the network partition. We show that when the associated Markov process is lumpable, one can recover the partition exactly with high probability. We apply the proposed approach to model the traffic flow of Manhattan as city-wide random walks. By using our algorithm to analyze the taxi trip data, we discover a latent partition of the Manhattan city that closely matches the traffic dynamics.

READ FULL TEXT
research
10/01/2022

Clustering for directed graphs using parametrized random walk diffusion kernels

Clustering based on the random walk operator has been proven effective f...
research
09/27/2019

Identifying Low-Dimensional Structures in Markov Chains: A Nonnegative Matrix Factorization Approach

A variety of queries about stochastic systems boil down to study of Mark...
research
11/04/2021

Community detection using low-dimensional network embedding algorithms

With the increasing relevance of large networks in important areas such ...
research
03/11/2013

Spectral Clustering with Epidemic Diffusion

Spectral clustering is widely used to partition graphs into distinct mod...
research
08/25/2023

Network Embedding Using Sparse Approximations of Random Walks

In this paper, we propose an efficient numerical implementation of Netwo...
research
07/19/2022

Clustering constrained on linear networks

An unsupervised classification method for point events occurring on a ne...
research
07/25/2022

Orthogonalization of data via Gromov-Wasserstein type feedback for clustering and visualization

In this paper we propose an adaptive approach for clustering and visuali...

Please sign up or login with your details

Forgot password? Click here to reset