GraphGen: A Scalable Approach to Domain-agnostic Labeled Graph Generation

01/22/2020
by   Nikhil Goyal, et al.
27

Graph generative models have been extensively studied in the data mining literature. While traditional techniques are based on generating structures that adhere to a pre-decided distribution, recent techniques have shifted towards learning this distribution directly from the data. While learning-based approaches have imparted significant improvement in quality, some limitations remain to be addressed. First, learning graph distributions introduces additional computational overhead, which limits their scalability to large graph databases. Second, many techniques only learn the structure and do not address the need to also learn node and edge labels, which encode important semantic information and influence the structure itself. Third, existing techniques often incorporate domain-specific rules and lack generalizability. Fourth, the experimentation of existing techniques is not comprehensive enough due to either using weak evaluation metrics or focusing primarily on synthetic or small datasets. In this work, we develop a domain-agnostic technique called GraphGen to overcome all of these limitations. GraphGen converts graphs to sequences using minimum DFS codes. Minimum DFS codes are canonical labels and capture the graph structure precisely along with the label information. The complex joint distributions between structure and semantic labels are learned through a novel LSTM architecture. Extensive experiments on million-sized, real graph datasets show GraphGen to be 4 times faster on average than state-of-the-art techniques while being significantly better in quality across a comprehensive set of 11 different metrics. Our code is released at https://github.com/idea-iitd/graphgen.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/18/2021

GraphGen-Redux: a Fast and Lightweight Recurrent Model for labeled Graph Generation

The problem of labeled graph generation is gaining attention in the Deep...
research
01/24/2022

On Evaluation Metrics for Graph Generative Models

In image generation, generative models can be evaluated naturally by vis...
research
06/05/2019

GRAM: Scalable Generative Models for Graphs with Graph Attention Mechanism

Graphs are ubiquitous real-world data structures, and generative models ...
research
01/25/2023

Improving Graph Generation by Restricting Graph Bandwidth

Deep graph generative modeling has proven capable of learning the distri...
research
08/21/2022

Heterogeneous Graph Masked Autoencoders

Generative self-supervised learning (SSL), especially masked autoencoder...
research
01/24/2022

Learning Graph Augmentations to Learn Graph Representations

Devising augmentations for graph contrastive learning is challenging due...
research
07/13/2023

GRAN is superior to GraphRNN: node orderings, kernel- and graph embeddings-based metrics for graph generators

A wide variety of generative models for graphs have been proposed. They ...

Please sign up or login with your details

Forgot password? Click here to reset