Measuring similarity in co-occurrence data using ego-networks

07/27/2020
by   Xiaomeng Wang, et al.
0

The co-occurrence association is widely observed in many empirical data. Mining the information in co-occurrence data is essential for advancing our understanding of systems such as social networks, ecosystem, and brain network. Measuring similarity of entities is one of the important tasks, which can usually be achieved using a network-based approach. Here we show that traditional methods based on the aggregated network can bring unwanted in-directed relationship. To cope with this issue, we propose a similarity measure based on the ego network of each entity, which effectively considers the change of an entity's centrality from one ego network to another. The index proposed is easy to calculate and has a clear physical meaning. Using two different data sets, we compare the new index with other existing ones. We find that the new index outperforms the traditional network-based similarity measures, and it can sometimes surpass the embedding method. In the meanwhile, the measure by the new index is weakly correlated with those by other methods, hence providing a different dimension to quantify similarities in co-occurrence data. Altogether, our work makes an extension in the network-based similarity measure and can be potentially applied in several related tasks.

READ FULL TEXT
research
05/01/2019

Similarity of Neural Network Representations Revisited

Recent work has sought to understand the behavior of neural networks by ...
research
03/23/2023

A Novel Patent Similarity Measurement Methodology: Semantic Distance and Technological Distance

Measuring similarity between patents is an essential step to ensure nove...
research
08/17/2023

Real-Time Construction Algorithm of Co-Occurrence Network Based on Inverted Index

Co-occurrence networks are an important method in the field of natural l...
research
09/16/2020

A Network-Based High-Level Data Classification Algorithm Using Betweenness Centrality

Data classification is a major machine learning paradigm, which has been...
research
04/09/2018

Face Sketch Synthesis Style Similarity:A New Structure Co-occurrence Texture Measure

Existing face sketch synthesis (FSS) similarity measures are sensitive t...
research
05/27/2023

DotHash: Estimating Set Similarity Metrics for Link Prediction and Document Deduplication

Metrics for set similarity are a core aspect of several data mining task...
research
01/18/2014

Semantic Similarity Measures Applied to an Ontology for Human-Like Interaction

The focus of this paper is the calculation of similarity between two con...

Please sign up or login with your details

Forgot password? Click here to reset