Fine-grained classification of journal articles by relying on multiple layers of information through similarity network fusion: the case of the Cambridge Journal of Economics

by   Alberto Baccini, et al.

This paper explores the possibility of classifying journal articles by exploiting multiple information sources, instead of relying on only one information source at a time. In particular, the Similarity Network Fusion (SNF) technique is used to merge the different layers of information about articles when they are organized as a multiplex network. The method proposed is tested on a case study consisting of the articles published in the Cambridge Journal of Economics. The information about articles is organized in a two-layer multiplex where the first layer contains similarities among articles based on the full-text of articles, and the second layer contains similarities based on the cited references. The unsupervised similarity network fusion process combines the two layers by building a new single-layer network. Distance correlation and partial distance correlation indexes are then used for estimating the contribution of each layer of information to the determination of the structure of the fused network. A clustering algorithm is lastly applied to the fused network for obtaining a classification of articles. The classification obtained through SNF has been evaluated from an expert point of view, by inspecting whether it can be interpreted and labelled with reference to research programs and methodologies adopted in economics. Moreover, the classification obtained in the fused network is compared with the two classifications obtained when cited references and contents are considered separately. Overall, the classification obtained on the fused network appears to be fine-grained enough to represent the extreme heterogeneity characterizing the contributions published in the Cambridge Journal of Economics.


page 16

page 17


Similarity network fusion for scholarly journals

This paper explores intellectual and social proximity among scholarly jo...

Differences between preprints and journal articles : Trial using bioRxiv data

In this paper, we attempted to obtain knowledge about how research is co...

Mapping the backbone of the Humanities through the eyes of Wikipedia

The present study aims to establish a valid method by which to apply the...

ScienceWISE: Topic Modeling over Scientific Literature Networks

We provide an up-to-date view on the knowledge management system Science...

A Multiple Linear Regression Analysis to Measure the Journal Contribution to the Social Attention of Research

This paper proposes a three-year average of social attention as a more r...

Tweeting about journal articles: Engagement, marketing or just gibberish?

This paper presents preliminary results on the analysis of tweets to jou...

Please sign up or login with your details

Forgot password? Click here to reset