Return to basics: Clustering of scientific literature using structural information

04/10/2020
by   Jinhyuk Yun, et al.
0

Scholars frequently employ relatedness measures to estimate the similarity between two different items (e.g., documents, authors, and institutes). Such relatedness measures are commonly based on overlapping references (i.e., bibliographic coupling) or citations (i.e., co-citation) and can then be used with cluster analysis to find boundaries between research fields. Unfortunately, calculating a relatedness measure is challenging, especially for a large number of items, because the computational complexity is greater than linear. We propose an alternative method for identifying the research front that uses direct citation inspired by relatedness measures. Our novel approach simply replicates a node into two distinct nodes: a citing node and cited node. We then apply typical clustering methods to the modified network. Clusters of citing nodes should emulate those from the bibliographic coupling relatedness network, while clusters of cited nodes should act like those from the co-citation relatedness network. In validation tests, our proposed method demonstrated high levels of similarity with conventional relatedness-based methods. We also found that the clustering results of proposed method outperformed those of conventional relatedness-based measures regarding similarity with natural language processing–based classification.

READ FULL TEXT
research
10/29/2021

Generalization of bibliographic coupling and co-citation using the node split network

Bibliographic coupling (BC) and co-citation (CC) are the two most common...
research
01/21/2019

A principled methodology for comparing relatedness measures for clustering publications

There are many different relatedness measures, based for instance on cit...
research
03/16/2022

Modeling the obsolescence of research literature in disciplinary journals through the age of their cited references

There are different citation habits in the research fields that influenc...
research
03/23/2022

Assessing Network Representations for Identifying Interdisciplinarity

Many studies have sought to identify interdisciplinary research as a fun...
research
11/05/2018

Identifying influential patents in citation networks using enhanced VoteRank centrality

This study proposes the usage of a method called VoteRank, created by Zh...
research
10/18/2019

How Coupled are Mass Spectrometry and Capillary Electrophoresis?

The understanding of how science works can contribute to making scientif...

Please sign up or login with your details

Forgot password? Click here to reset