Embedding Signals on Knowledge Graphs with Unbalanced Diffusion Earth Mover's Distance

07/26/2021
by   Alexander Tong, et al.
0

In modern relational machine learning it is common to encounter large graphs that arise via interactions or similarities between observations in many domains. Further, in many cases the target entities for analysis are actually signals on such graphs. We propose to compare and organize such datasets of graph signals by using an earth mover's distance (EMD) with a geodesic cost over the underlying graph. Typically, EMD is computed by optimizing over the cost of transporting one probability distribution to another over an underlying metric space. However, this is inefficient when computing the EMD between many signals. Here, we propose an unbalanced graph earth mover's distance that efficiently embeds the unbalanced EMD on an underlying graph into an L^1 space, whose metric we call unbalanced diffusion earth mover's distance (UDEMD). This leads us to an efficient nearest neighbors kernel over many signals defined on a large graph. Next, we show how this gives distances between graph signals that are robust to noise. Finally, we apply this to organizing patients based on clinical notes who are modelled as signals on the SNOMED-CT medical knowledge graph, embedding lymphoblast cells modeled as signals on a gene graph, and organizing genes modeled as signals over a large peripheral blood mononuclear (PBMC) cell graph. In each case, we show that UDEMD-based embeddings find accurate distances that are highly efficient compared to other methods.

READ FULL TEXT
research
02/25/2021

Diffusion Earth Mover's Distance and Distribution Embeddings

We propose a new fast method of measuring distances between large number...
research
06/05/2023

Graph Fourier MMD for Signals on Graphs

While numerous methods have been proposed for computing distances betwee...
research
07/03/2017

People Mover's Distance: Class level geometry using fast pairwise data adaptive transportation costs

We address the problem of defining a network graph on a large collection...
research
09/09/2019

Distance-Preserving Graph Embeddings from Random Neural Features

We present Graph Random Neural Features (GRNF), a novel embedding method...
research
09/09/2021

FGOT: Graph Distances based on Filters and Optimal Transport

Graph comparison deals with identifying similarities and dissimilarities...
research
07/22/2019

Orometric Methods in Bounded Metric Data

A large amount of data accommodated in knowledge graphs (KG) is actually...
research
06/29/2021

Diff2Dist: Learning Spectrally Distinct Edge Functions, with Applications to Cell Morphology Analysis

We present a method for learning "spectrally descriptive" edge weights f...

Please sign up or login with your details

Forgot password? Click here to reset