Same Stats, Different Graphs (Graph Statistics and Why We Need Graph Drawings)

08/29/2018
by   Hang Chen, et al.
0

Data analysts commonly utilize statistics to summarize large datasets. While it is often sufficient to explore only the summary statistics of a dataset (e.g., min/mean/max), Anscombe's Quartet demonstrates how such statistics can be misleading. Graph mining has a similar problem in that graph statistics (e.g., density, connectivity, clustering coefficient) may not capture all of the critical properties of a given graph. To study the relationships between different graph properties and statistics, we examine all low-order (<= 10) non-isomorphic graphs and provide a simple visual analytics system to explore correlations across multiple graph properties. However, for graphs with more than ten nodes, generating the entire space of graphs becomes quickly intractable. We use different random graph generation methods to further look into the distribution of graph statistics for higher order graphs and investigate the impact of various sampling methodologies. We also describe a method for generating many graphs that are identical over a number of graph properties and statistics yet are clearly different and identifiably distinct.

READ FULL TEXT
research
11/04/2019

Same Stats, Different Graphs: Exploring the Space of Graphs in Terms of Graph Properties

Data analysts commonly utilize statistics to summarize large datasets. W...
research
02/15/2023

SynGraphy: Succinct Summarisation of Large Networks via Small Synthetic Representative Graphs

We describe SynGraphy, a method for visually summarising the structure o...
research
02/28/2023

On the tightness of graph-based statistics

We establish tightness of graph-based stochastic processes in the space ...
research
02/16/2021

Empirical Characterization of Graph Sampling Algorithms

Graph sampling allows mining a small representative subgraph from a big ...
research
11/29/2013

Statistical estimation for optimization problems on graphs

Large graphs abound in machine learning, data mining, and several relate...
research
03/21/2015

Using novelty-biased GA to sample diversity in graphs satisfying constraints

The structure of the network underlying many complex systems, whether ar...
research
11/08/2019

Multi-MotifGAN (MMGAN): Motif-targeted Graph Generation and Prediction

Generative graph models create instances of graphs that mimic the proper...

Please sign up or login with your details

Forgot password? Click here to reset