UniNet: Scalable Network Representation Learning with Metropolis-Hastings Sampling

10/10/2020
by   Xingyu Yao, et al.
5

Network representation learning (NRL) technique has been successfully adopted in various data mining and machine learning applications. Random walk based NRL is one popular paradigm, which uses a set of random walks to capture the network structural information, and then employs word2vec models to learn the low-dimensional representations. However, until now there is lack of a framework, which unifies existing random walk based NRL models and supports to efficiently learn from large networks. The main obstacle comes from the diverse random walk models and the inefficient sampling method for the random walk generation. In this paper, we first introduce a new and efficient edge sampler based on Metropolis-Hastings sampling technique, and theoretically show the convergence property of the edge sampler to arbitrary discrete probability distributions. Then we propose a random walk model abstraction, in which users can easily define different transition probability by specifying dynamic edge weights and random walk states. The abstraction is efficiently supported by our edge sampler, since our sampler can draw samples from unnormalized probability distribution in constant time complexity. Finally, with the new edge sampler and random walk model abstraction, we carefully implement a scalable NRL framework called UniNet. We conduct comprehensive experiments with five random walk based NRL models over eleven real-world datasets, and the results clearly demonstrate the efficiency of UniNet over billion-edge networks.

READ FULL TEXT
research
10/07/2021

Graph sampling by lagged random walk

We propose a family of lagged random walk sampling methods in simple und...
research
05/23/2019

The Skipping Sampler: A new approach to sample from complex conditional densities

We introduce the Skipping Sampler, a novel algorithm to efficiently samp...
research
05/09/2018

Diffusion Based Network Embedding

In network embedding, random walks play a fundamental role in preserving...
research
10/31/2022

Random Walk-based Community Key-members Search over Large Graphs

Given a graph G, a query node q, and an integer k, community search (CS)...
research
10/26/2019

Efficient Metropolis-Hastings Sampling for Nonlinear Mixed Effects Models

The ability to generate samples of the random effects from their conditi...
research
11/18/2019

RWNE: A Scalable Random-Walk based Network Embedding Framework with Personalized Higher-order Proximity Preserved

Higher-order proximity preserved network embedding has attracted increas...
research
07/26/2021

ThunderRW: An In-Memory Graph Random Walk Engine (Complete Version)

As random walk is a powerful tool in many graph processing, mining and l...

Please sign up or login with your details

Forgot password? Click here to reset