Graph Fourier MMD for Signals on Graphs

06/05/2023
by   Samuel Leone, et al.
0

While numerous methods have been proposed for computing distances between probability distributions in Euclidean space, relatively little attention has been given to computing such distances for distributions on graphs. However, there has been a marked increase in data that either lies on graph (such as protein interaction networks) or can be modeled as a graph (single cell data), particularly in the biomedical sciences. Thus, it becomes important to find ways to compare signals defined on such graphs. Here, we propose Graph Fourier MMD (GFMMD), a novel distance between distributions and signals on graphs. GFMMD is defined via an optimal witness function that is both smooth on the graph and maximizes difference in expectation between the pair of distributions on the graph. We find an analytical solution to this optimization problem as well as an embedding of distributions that results from this method. We also prove several properties of this method including scale invariance and applicability to disconnected graphs. We showcase it on graph benchmark datasets as well on single cell RNA-sequencing data analysis. In the latter, we use the GFMMD-based gene embeddings to find meaningful gene clusters. We also propose a novel type of score for gene selection called "gene localization score" which helps select genes for cellular state space characterization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/26/2021

Embedding Signals on Knowledge Graphs with Unbalanced Diffusion Earth Mover's Distance

In modern relational machine learning it is common to encounter large gr...
research
02/01/2021

The Gene Mover's Distance: Single-cell similarity via Optimal Transport

This paper introduces the Gene Mover's Distance, a measure of similarity...
research
02/15/2018

Direct Estimation of Differences in Causal Graphs

We consider the problem of estimating the differences between two causal...
research
10/21/2019

Is graph biased feature selection of genes better than random?

Gene interaction graphs aim to capture various relationships between gen...
research
06/18/2018

Towards Gene Expression Convolutions using Gene Interaction Graphs

We study the challenges of applying deep learning to gene expression dat...
research
07/19/2020

EPGAT: Gene Essentiality Prediction With Graph Attention Networks

The identification of essential genes/proteins is a critical step toward...
research
06/15/2022

Multiscale methods for signal selection in single-cell data

Analysis of single-cell transcriptomics often relies on clustering cells...

Please sign up or login with your details

Forgot password? Click here to reset