A multiple k-means cluster ensemble framework for clustering citation trajectories

by   Joyita Chakraborty, et al.

Citation maturity time varies for different articles. However, the impact of all articles is measured in a fixed window. Clustering their citation trajectories helps understand the knowledge diffusion process and reveals that not all articles gain immediate success after publication. Moreover, clustering trajectories is necessary for paper impact recommendation algorithms. It is a challenging problem because citation time series exhibit significant variability due to non linear and non stationary characteristics. Prior works propose a set of arbitrary thresholds and a fixed rule based approach. All methods are primarily parameter dependent. Consequently, it leads to inconsistencies while defining similar trajectories and ambiguities regarding their specific number. Most studies only capture extreme trajectories. Thus, a generalised clustering framework is required. This paper proposes a feature based multiple k means cluster ensemble framework. 1,95,783 and 41,732 well cited articles from the Microsoft Academic Graph data are considered for clustering short term (10 year) and long term (30 year) trajectories, respectively. It has linear run time. Four distinct trajectories are obtained Early Rise Rapid Decline (2.2 Decline (53 differences for two different spans are studied. Most papers exhibit Early Rise Slow Decline and Delayed Rise No Decline patterns. The growth and decay times, cumulative citation distribution, and peak characteristics of individual trajectories are redefined empirically. A detailed comparative study reveals our proposed methodology can detect all distinct trajectory classes.


page 1

page 2

page 3

page 4


Search for Evergreens in Science: A Functional Data Analysis

Evergreens in science are papers that display a continual rise in annual...

Anomalous diffusion in the citation time series of scientific publications

We analyze the citation time-series of manuscripts in three different fi...

All downhill from the PhD? The typical impact trajectory of US academic careers

Within academia, mature researchers tend to be more senior, but do they ...

Same data may bring conflict results: a caution to use the disruptive index

In the last two decades, scholars have designed various types of bibliog...

The funding effect on citation and social attention: the UN Sustainable Development Goals (SDGs) as a case study

Purpose: Academic citation and social attention measure different dimens...

Please sign up or login with your details

Forgot password? Click here to reset