Interpretable Network Representation Learning with Principal Component Analysis

06/27/2021
by   James D. Wilson, et al.
11

We consider the problem of interpretable network representation learning for samples of network-valued data. We propose the Principal Component Analysis for Networks (PCAN) algorithm to identify statistically meaningful low-dimensional representations of a network sample via subgraph count statistics. The PCAN procedure provides an interpretable framework for which one can readily visualize, explore, and formulate predictive models for network samples. We furthermore introduce a fast sampling-based algorithm, sPCAN, which is significantly more computationally efficient than its counterpart, but still enjoys advantages of interpretability. We investigate the relationship between these two methods and analyze their large-sample properties under the common regime where the sample of networks is a collection of kernel-based random graphs. We show that under this regime, the embeddings of the sPCAN method enjoy a central limit theorem and moreover that the population level embeddings of PCAN and sPCAN are equivalent. We assess PCAN's ability to visualize, cluster, and classify observations in network samples arising in nature, including functional connectivity network samples and dynamic networks describing the political co-voting habits of the U.S. Senate. Our analyses reveal that our proposed algorithm provides informative and discriminatory features describing the networks in each sample. The PCAN and sPCAN methods build on the current literature of network representation learning and set the stage for a new line of research in interpretable learning on network-valued data. Publicly available software for the PCAN and sPCAN methods are available at https://www.github.com/jihuilee/.

READ FULL TEXT

page 18

page 19

page 24

page 25

research
04/28/2021

Interpretable Embedding Procedure Knowledge Transfer via Stacked Principal Component Analysis and Graph Neural Network

Knowledge distillation (KD) is one of the most useful techniques for lig...
research
02/22/2023

Deep Kernel Principal Component Analysis for Multi-level Feature Learning

Principal Component Analysis (PCA) and its nonlinear extension Kernel PC...
research
11/14/2022

An online algorithm for contrastive Principal Component Analysis

Finding informative low-dimensional representations that can be computed...
research
01/23/2019

Incremental Principal Component Analysis Exact implementation and continuity corrections

This paper describes some applications of an incremental implementation ...
research
02/12/2020

Structure-Property Maps with Kernel Principal Covariates Regression

Data analysis based on linear methods, which look for correlations betwe...
research
02/21/2019

Manifold valued data analysis of samples of networks, with applications in corpus linguistics

Networks can be used in many applications, such as in the analysis of te...
research
06/01/2023

Going Deeper with Spectral Embeddings

To make sense of millions of raw data and represent them efficiently, pr...

Please sign up or login with your details

Forgot password? Click here to reset