G2A2: An Automated Graph Generator with Attributes and Anomalies

10/14/2022
by   Saikat Dey, et al.
0

Many data-mining applications use dynamic attributed graphs to represent relational information; but due to security and privacy concerns, there is a dearth of available datasets that can be represented as dynamic attributed graphs. Even when such datasets are available, they do not have ground truth that can be used to train deep-learning models. Thus, we present G2A2, an automated graph generator with attributes and anomalies, which encompasses (1) probabilistic models to generate a dynamic bipartite graph, representing time-evolving connections between two independent sets of entities, (2) realistic injection of anomalies using a novel algorithm that captures the general properties of graph anomalies across domains, and (3) a deep generative model to produce realistic attributes, learned from an existing real-world dataset. Using the maximum mean discrepancy (MMD) metric to evaluate the realism of a G2A2-generated graph against three real-world graphs, G2A2 outperforms Kronecker graph generation by reducing the MMD distance by up to six-fold (6x).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/09/2022

Optimal Graph Filters for Clustering Attributed Graphs

Many real-world systems can be represented as graphs where the different...
research
07/23/2019

Node Attribute Generation on Graphs

Graph structured data provide two-fold information: graph structures and...
research
01/14/2020

Change Detection in Dynamic Attributed Networks

A network provides powerful means of representing complex relationships ...
research
08/09/2023

Data-driven Intra-Autonomous Systems Graph Generator

This paper introduces a novel deep-learning based generator of synthetic...
research
02/28/2023

Effective Community Search on Large Attributed Bipartite Graphs

Community search over bipartite graphs has attracted significant interes...
research
05/16/2018

Investigating the Agility Bias in DNS Graph Mining

The concept of agile domain name system (DNS) refers to dynamic and rapi...
research
09/30/2014

Data Imputation through the Identification of Local Anomalies

We introduce a comprehensive and statistical framework in a model free s...

Please sign up or login with your details

Forgot password? Click here to reset