Discovering Representative Attribute-stars via Minimum Description Length

04/27/2022
by   Jiahong Liu, et al.
0

Graphs are a popular data type found in many domains. Numerous techniques have been proposed to find interesting patterns in graphs to help understand the data and support decision-making. However, there are generally two limitations that hinder their practical use: (1) they have multiple parameters that are hard to set but greatly influence results, (2) and they generally focus on identifying complex subgraphs while ignoring relationships between attributes of nodes.Graphs are a popular data type found in many domains. Numerous techniques have been proposed to find interesting patterns in graphs to help understand the data and support decision-making. However, there are generally two limitations that hinder their practical use: (1) they have multiple parameters that are hard to set but greatly influence results, (2) and they generally focus on identifying complex subgraphs while ignoring relationships between attributes of nodes. To address these problems, we propose a parameter-free algorithm named CSPM (Compressing Star Pattern Miner) which identifies star-shaped patterns that indicate strong correlations among attributes via the concept of conditional entropy and the minimum description length principle. Experiments performed on several benchmark datasets show that CSPM reveals insightful and interpretable patterns and is efficient in runtime. Moreover, quantitative evaluations on two real-world applications show that CSPM has broad applications as it successfully boosts the accuracy of graph attribute completion models by up to 30.68% and uncovers important patterns in telecommunication alarm data.

READ FULL TEXT
research
03/11/2020

Compacting Frequent Star Patterns in RDF Graphs

Knowledge graphs have become a popular formalism for representing entiti...
research
11/03/2020

Learning on Attribute-Missing Graphs

Graphs with complete node attributes have been widely explored recently....
research
10/18/2019

Towards Interpretable Graph Modeling with Vertex Replacement Grammars

An enormous amount of real-world data exists in the form of graphs. Ofte...
research
12/22/2015

Keeping it Short and Simple: Summarising Complex Event Sequences with Multivariate Patterns

We study how to obtain concise descriptions of discrete multivariate seq...
research
05/22/2020

Discovering Frequent Gradual Itemsets with Imprecise Data

The gradual patterns that model the complex co-variations of attributes ...
research
06/19/2023

Pattern Mining for Anomaly Detection in Graphs: Application to Fraud in Public Procurement

In the context of public procurement, several indicators called red flag...
research
09/12/2020

Discovering Interesting Subgraphs in Social Media Networks

Social media data are often modeled as heterogeneous graphs with multipl...

Please sign up or login with your details

Forgot password? Click here to reset