Neighborhood Growth Determines Geometric Priors for Relational Representation Learning

10/12/2019
by   Melanie Weber, et al.
0

The problem of identifying geometric structure in heterogeneous, high-dimensional data is a cornerstone of representation learning. While there exists a large body of literature on the embeddability of canonical graphs, such as lattices or trees, the heterogeneity of the relational data typically encountered in practice limits the applicability of these classical methods. In this paper, we propose a combinatorial approach to evaluating embeddability, i.e., to decide whether a data set is best represented in Euclidean, Hyperbolic or Spherical space. Our method analyzes nearest-neighbor structures and local neighborhood growth rates to identify the geometric priors of suitable embedding spaces. For canonical graphs, the algorithm's prediction provably matches classical results. As for large, heterogeneous graphs, we introduce an efficiently computable statistic that approximates the algorithm's decision rule. We validate our method over a range of benchmark data sets and compare with recently published optimization-based embeddability methods.

READ FULL TEXT
research
06/25/2022

Geometry Contrastive Learning on Heterogeneous Graphs

Self-supervised learning (especially contrastive learning) methods on he...
research
05/28/2018

GESF: A Universal Discriminative Mapping Mechanism for Graph Representation Learning

Graph embedding is a central problem in social network analysis and many...
research
12/04/2021

Revisiting k-Nearest Neighbor Graph Construction on High-Dimensional Data : Experiments and Analyses

The k-nearest neighbor graph (KNNG) on high-dimensional data is a data s...
research
06/21/2023

Geometric Algorithms for k-NN Poisoning

We propose a label poisoning attack on geometric data sets against k-nea...
research
02/19/2019

Euclidean TSP, Motorcycle Graphs, and Other New Applications of Nearest-Neighbor Chains

We show new applications of the nearest-neighbor chain algorithm, a tech...
research
09/08/2021

Highly Scalable and Provably Accurate Classification in Poincare Balls

Many high-dimensional and large-volume data sets of practical relevance ...
research
03/06/2019

Fast Graph Representation Learning with PyTorch Geometric

We introduce PyTorch Geometric, a library for deep learning on irregular...

Please sign up or login with your details

Forgot password? Click here to reset