Asymptotics of Network Embeddings Learned via Subsampling

07/06/2021
by   Andrew Davison, et al.
0

Network data are ubiquitous in modern machine learning, with tasks of interest including node classification, node clustering and link prediction. A frequent approach begins by learning an Euclidean embedding of the network, to which algorithms developed for vector-valued data are applied. For large networks, embeddings are learned using stochastic gradient methods where the sub-sampling scheme can be freely chosen. Despite the strong empirical performance of such methods, they are not well understood theoretically. Our work encapsulates representation methods using a subsampling approach, such as node2vec, into a single unifying framework. We prove, under the assumption that the graph is exchangeable, that the distribution of the learned embedding vectors asymptotically decouples. Moreover, we characterize the asymptotic distribution and provided rates of convergence, in terms of the latent parameters, which includes the choice of loss function and the embedding dimension. This provides a theoretical foundation to understand what the embedding vectors represent and how well these methods perform on downstream tasks. Notably, we observe that typically used loss functions may lead to shortcomings, such as a lack of Fisher consistency.

READ FULL TEXT
research
01/05/2022

Asymptotics of ℓ_2 Regularized Network Embeddings

A common approach to solving tasks, such as node classification or link ...
research
02/14/2021

Adversarial Attack on Network Embeddings via Supervised Network Poisoning

Learning low-level node embeddings using techniques from network represe...
research
02/10/2018

Low-Norm Graph Embedding

Learning distributed representations for nodes in graphs has become an i...
research
12/11/2020

Pair-view Unsupervised Graph Representation Learning

Low-dimension graph embeddings have proved extremely useful in various d...
research
07/10/2020

Next Waves in Veridical Network Embedding

Embedding nodes of a large network into a metric (e.g., Euclidean) space...
research
11/26/2019

Network Embedding: An Overview

Networks are one of the most powerful structures for modeling problems i...
research
05/19/2018

Conditional Network Embeddings

Network embeddings map the nodes of a given network into d-dimensional E...

Please sign up or login with your details

Forgot password? Click here to reset