PDNS-Net: A Large Heterogeneous Graph Benchmark Dataset of Network Resolutions for Graph Learning

03/15/2022
by   Udesh Kumarasinghe, et al.
0

In order to advance the state of the art in graph learning algorithms, it is necessary to construct large real-world datasets. While there are many benchmark datasets for homogeneous graphs, only a few of them are available for heterogeneous graphs. Furthermore, the latter graphs are small in size rendering them insufficient to understand how graph learning algorithms perform in terms of classification metrics and computational resource utilization. We introduce, PDNS-Net, the largest public heterogeneous graph dataset containing 447K nodes and 897K edges for the malicious domain classification task. Compared to the popular heterogeneous datasets IMDB and DBLP, PDNS-Net is 38 and 17 times bigger respectively. We provide a detailed analysis of PDNS-Net including the data collection methodology, heterogeneous graph construction, descriptive statistics and preliminary graph classification performance. The dataset is publicly available at https://github.com/qcri/PDNS-Net. Our preliminary evaluation of both popular homogeneous and heterogeneous graph neural networks on PDNS-Net reveals that further research is required to improve the performance of these models on large heterogeneous graphs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/20/2023

Improving Article Classification with Edge-Heterogeneous Graph Neural Networks

Classifying research output into context-specific label taxonomies is a ...
research
05/31/2023

Spectral Heterogeneous Graph Convolutions via Positive Noncommutative Polynomials

Heterogeneous Graph Neural Networks (HGNNs) have gained significant popu...
research
07/16/2020

TUDataset: A collection of benchmark datasets for learning with graphs

Recently, there has been an increasing interest in (supervised) learning...
research
11/16/2020

A Large-Scale Database for Graph Representation Learning

With the rapid emergence of graph representation learning, the construct...
research
03/05/2023

Heterogeneous Graph Learning for Acoustic Event Classification

Heterogeneous graphs provide a compact, efficient, and scalable way to m...
research
05/04/2023

PGB: A PubMed Graph Benchmark for Heterogeneous Network Representation Learning

There has been a rapid growth in biomedical literature, yet capturing th...
research
04/14/2022

EXPERT: Public Benchmarks for Dynamic Heterogeneous Academic Graphs

Machine learning models that learn from dynamic graphs face nontrivial c...

Please sign up or login with your details

Forgot password? Click here to reset