Universal Approximation of Edge Density in Large Graphs

08/06/2015
by   Marc Boullé, et al.
0

In this paper, we present a novel way to summarize the structure of large graphs, based on non-parametric estimation of edge density in directed multigraphs. Following coclustering approach, we use a clustering of the vertices, with a piecewise constant estimation of the density of the edges across the clusters, and address the problem of automatically and reliably inferring the number of clusters, which is the granularity of the coclustering. We use a model selection technique with data-dependent prior and obtain an exact evaluation criterion for the posterior probability of edge density estimation models. We demonstrate, both theoretically and empirically, that our data-dependent modeling technique is consistent, resilient to noise, valid non asymptotically and asymptotically behaves as an universal approximator of the true edge density in directed multigraphs. We evaluate our method using artificial graphs and present its practical interest on real world graphs. The method is both robust and scalable. It is able to extract insightful patterns in the unsupervised learning setting and to provide state of the art accuracy when used as a preparation step for supervised learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/23/2022

Clustering Edges in Directed Graphs

How do vertices exert influence in graph data? We develop a framework fo...
research
10/07/2016

Combining local and global smoothing in multivariate density estimation

Non-parametric estimation of a multivariate density estimation is tackle...
research
07/27/2018

Deep PDF: Probabilistic Surface Optimization and Density Estimation

A probability density function (pdf) encodes the entire stochastic knowl...
research
12/07/2015

Clustering by Deep Nearest Neighbor Descent (D-NND): A Density-based Parameter-Insensitive Clustering Method

Most density-based clustering methods largely rely on how well the under...
research
08/03/2022

Common Pairs of Graphs

A graph H is said to be common if the number of monochromatic labelled c...
research
07/22/2019

Direction Matters: On Influence-Preserving Graph Summarization and Max-cut Principle for Directed Graphs

Summarizing large-scaled directed graphs into small-scale representation...
research
04/10/2015

3D Density Histograms for Criteria-driven Edge Bundling

This paper presents a graph bundling algorithm that agglomerates edges t...

Please sign up or login with your details

Forgot password? Click here to reset