Promise and Limitations of Supervised Optimal Transport-Based Graph Summarization via Information Theoretic Measures

05/11/2023
by   Sepideh Neshatfar, et al.
6

Graph summarization is the problem of producing smaller graph representations of an input graph dataset, in such a way that the smaller compressed graphs capture relevant structural information for downstream tasks. There is a recent graph summarization method that formulates an optimal transport-based framework that allows prior information about node, edge, and attribute importance (never defined in that work) to be incorporated into the graph summarization process. However, very little is known about the statistical properties of this framework. To elucidate this question, we consider the problem of supervised graph summarization, wherein by using information theoretic measures we seek to preserve relevant information about a class label. To gain a theoretical perspective on the supervised summarization problem itself, we first formulate it in terms of maximizing the Shannon mutual information between the summarized graph and the class label. We show an NP-hardness of approximation result for this problem, thereby constraining what one should expect from proposed solutions. We then propose a summarization method that incorporates mutual information estimates between random variables associated with sample graphs and class labels into the optimal transport compression framework. We empirically show performance improvements over previous works in terms of classification accuracy and time on synthetic and certain real datasets. We also theoretically explore the limitations of the optimal transport approach for the supervised summarization problem and we show that it fails to satisfy a certain desirable information monotonicity property.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/09/2020

COPT: Coordinated Optimal Transport on Graphs

We introduce COPT, a novel distance metric between graphs defined via an...
research
05/29/2019

Solving graph compression via optimal transport

We propose a new approach to graph compression by appeal to optimal tran...
research
09/05/2019

LSMI-Sinkhorn: Semi-supervised Squared-Loss Mutual Information Estimation with Optimal Transport

Estimating mutual information is an important machine learning and stati...
research
10/06/2022

InfoOT: Information Maximizing Optimal Transport

Optimal transport aligns samples across distributions by minimizing the ...
research
10/01/2021

Label Propagation Through Optimal Transport

In this paper, we tackle the transductive semi-supervised learning probl...
research
04/30/2020

Graph Summarization

The continuous and rapid growth of highly interconnected datasets, which...
research
12/24/2019

Unsupervised Learning of Graph Hierarchical Abstractions with Differentiable Coarsening and Optimal Transport

Hierarchical abstractions are a methodology for solving large-scale grap...

Please sign up or login with your details

Forgot password? Click here to reset