Consistency of random-walk based network embedding algorithms

01/18/2021
by   Yichi Zhang, et al.
0

Random-walk based network embedding algorithms like node2vec and DeepWalk are widely used to obtain Euclidean representation of the nodes in a network prior to performing down-stream network inference tasks. Nevertheless, despite their impressive empirical performance, there is a lack of theoretical results explaining their behavior. In this paper we studied the node2vec and DeepWalk algorithms through the perspective of matrix factorization. We analyze these algorithms in the setting of community detection for stochastic blockmodel graphs; in particular we established large-sample error bounds and prove consistent community recovery of node2vec/DeepWalk embedding followed by k-means clustering. Our theoretical results indicate a subtle interplay between the sparsity of the observed networks, the window sizes of the random walks, and the convergence rates of the node2vec/DeepWalk embedding toward the embedding of the true but unknown edge probabilities matrix. More specifically, as the network becomes sparser, our results suggest using larger window sizes, or equivalently, taking longer random walks, in order to attain better convergence rate for the resulting embeddings. The paper includes numerical experiments corroborating these observations.

READ FULL TEXT
research
11/04/2021

Community detection using low-dimensional network embedding algorithms

With the increasing relevance of large networks in important areas such ...
research
07/04/2023

Random Walk on Multiple Networks

Random Walk is a basic algorithm to explore the structure of networks, w...
research
01/21/2021

Synwalk – Community Detection via Random Walk Modelling

Complex systems, abstractly represented as networks, are ubiquitous in e...
research
07/21/2021

Delving Into Deep Walkers: A Convergence Analysis of Random-Walk-Based Vertex Embeddings

Graph vertex embeddings based on random walks have become increasingly i...
research
09/09/2021

Ergodic Limits, Relaxations, and Geometric Properties of Random Walk Node Embeddings

Random walk based node embedding algorithms learn vector representations...
research
10/02/2019

A new method for quantifying network cyclic structure to improve community detection

A distinguishing property of communities in networks is that cycles are ...
research
11/27/2018

Knots in random neural networks

The weights of a neural network are typically initialized at random, and...

Please sign up or login with your details

Forgot password? Click here to reset