Graphlet Count Estimation via Convolutional Neural Networks

10/07/2018
by   Xutong Liu, et al.
0

Graphlets are defined as k-node connected induced subgraph patterns. For an undirected graph, 3-node graphlets include close triangle and open triangle. When k = 4, there are six types of graphlets, e.g., tailed-triangle and clique are two possible 4-node graphlets. The number of each graphlet, called graphlet count, is a signature which characterizes the local network structure of a given graph. Graphlet count plays a prominent role in network analysis of many fields, most notably bioinformatics and social science. However, computing exact graphlet count is inherently difficult and computational expensive because the number of graphlets grows exponentially large as the graph size and/or graphlet size k grow. To deal with this difficulty, many sampling methods were proposed to estimate graphlet count with bounded error. Nevertheless, these methods require large number of samples to be statistically reliable, which is still computationally demanding. Moreover, they have to repeat laborious counting procedure even if a new graph is similar or exactly the same as previous studied graphs. Intuitively, learning from historic graphs can make estimation more accurate and avoid many repetitive counting to reduce computational cost. Based on this idea, we propose a convolutional neural network (CNN) framework and two preprocessing techniques to estimate graphlet count. Extensive experiments on two types of random graphs and real world biochemistry graphs show that our framework can offer substantial speedup on estimating graphlet count of new graphs with high accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/22/2018

REPT: A Streaming Algorithm of Approximating Global and Local Triangle Counts in Parallel

Recently, considerable efforts have been devoted to approximately comput...
research
01/06/2021

Counting independent sets in strongly orderable graphs

We consider the problem of devising algorithms to count exactly the numb...
research
04/20/2022

Scalable Motif Counting for Large-scale Temporal Graphs

One fundamental problem in temporal graph analysis is to count the occur...
research
06/22/2020

How to Count Triangles, without Seeing the Whole Graph

Triangle counting is a fundamental problem in the analysis of large grap...
research
02/12/2018

DiSLR: Distributed Sampling with Limited Redundancy For Triangle Counting in Graph Streams

Given a web-scale graph that grows over time, how should its edges be st...
research
05/31/2023

Improving Expressivity of Graph Neural Networks using Localization

In this paper, we propose localized versions of Weisfeiler-Leman (WL) al...
research
10/29/2021

On the Power of Edge Independent Graph Models

Why do many modern neural-network-based graph generative models fail to ...

Please sign up or login with your details

Forgot password? Click here to reset