Heat diffusion distance processes: a statistically founded method to analyze graph data sets

09/27/2021
by   Etienne Lasalle, et al.
0

We propose two multiscale comparisons of graphs using heat diffusion, allowing to compare graphs without node correspondence or even with different sizes. These multiscale comparisons lead to the definition of Lipschitz-continuous empirical processes indexed by a real parameter. The statistical properties of empirical means of such processes are studied in the general case. Under mild assumptions, we prove a functional Central Limit Theorem, as well as a Gaussian approximation with a rate depending only on the sample size. Once applied to our processes, these results allow to analyze data sets of pairs of graphs. We design consistent confidence bands around empirical means and consistent two-sample tests, using bootstrap methods. Their performances are evaluated by simulations on synthetic data sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/18/2019

Simultaneous Confidence Bands for Functional Data Using the Gaussian Kinematic Formula

This article constructs simultaneous confidence bands (SCBs) for functio...
research
05/20/2020

Functional delta residuals and applications to functional effect sizes

Given a functional central limit (fCLT) and a parameter transformation, ...
research
02/28/2021

Diffusion Means and Heat Kernel on Manifolds

We introduce diffusion means as location statistics on manifold data spa...
research
02/15/2018

Prediction of spatial functional random processes: Comparing functional and spatio-temporal kriging approaches

In this paper, we present and compare functional and spatio-temporal (Sp...
research
02/28/2021

A Central Limit Theorem for Diffusion in Sparse Random Graphs

We consider bootstrap percolation and diffusion in sparse random graphs ...
research
02/04/2019

Distances between Data Sets Based on Summary Statistics

The concepts of similarity and distance are crucial in data mining. We c...
research
09/16/2009

Computing p-values of LiNGAM outputs via Multiscale Bootstrap

Structural equation models and Bayesian networks have been widely used t...

Please sign up or login with your details

Forgot password? Click here to reset