NetMF+: Network Embedding Based on Fast and Effective Single-Pass Randomized Matrix Factorization

10/25/2021
by   Yuyang Xie, et al.
0

In this work, we propose NetMF+, a fast, memory-efficient, scalable, and effective network embedding algorithm developed for a single machine with CPU only. NetMF+ is based on the theoretically grounded embedding method NetMF and leverages the theories from randomized matrix factorization to learn embedding efficiently. We firstly propose a fast randomized eigen-decomposition algorithm for the modified Laplacian matrix. Then, sparse-sign randomized single-pass singular value decomposition (SVD) is utilized to avoid constructing dense matrix and generate promising embedding. To enhance the performance of embedding, we apply spectral propagation in NetMF+. Finally, A high-performance parallel graph processing stack GBBS is used to achieve memory-efficiency. Experiment results show that NetMF+ can learn a powerful embedding from a network with more than 10^11 edges within 1.5 hours at lower memory cost than state-of-the-art methods. The result on ClueWeb with 0.9 billion vertices and 75 billion edges shows that NetMF+ saves more than half of the memory and runtime than the state-of-the-art and has better performance. The source code of NetMF+ will be publicly available after the anonymous peer review.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/26/2019

NetSMF: Large-Scale Network Embedding as Sparse Matrix Factorization

We study the problem of large-scale network embedding, which aims to lea...
research
09/04/2020

Efficient Model-Based Collaborative Filtering with Fast Adaptive PCA

A model-based collaborative filtering (CF) approach utilizing fast adapt...
research
07/15/2019

Out-of-core singular value decomposition

Singular value decomposition (SVD) is a standard matrix factorization te...
research
10/05/2021

Revisiting SVD to generate powerful Node Embeddings for Recommendation Systems

Graph Representation Learning (GRL) is an upcoming and promising area in...
research
11/16/2020

RCHOL: Randomized Cholesky Factorization for Solving SDD Linear Systems

We introduce a randomized algorithm, namely RCHOL, to construct an appro...
research
04/19/2018

Programming Parallel Dense Matrix Factorizations with Look-Ahead and OpenMP

We investigate a parallelization strategy for dense matrix factorization...
research
10/12/2021

Deviance Matrix Factorization

We investigate a general matrix factorization for deviance-based losses,...

Please sign up or login with your details

Forgot password? Click here to reset