Memetic search for overlapping topics based on a local evaluation of link communities

02/09/2017
by   Frank Havemann, et al.
0

In spite of recent advances in field delineation methods, bibliometricians still don't know the extent to which their topic detection algorithms reconstruct `ground truths', i.e. thematic structures in the scientific literature. In this paper, we demonstrate a new approach to the delineation of thematic structures that attempts to match the algorithm to theoretically derived and empirically observed properties all thematic structures have in common. We cluster citation links rather than publication nodes, use predominantly local information and search for communities of links starting from seed subgraphs in order to allow for pervasive overlaps of topics. We evaluate sets of links with a new cost function and assume that local minima in the cost landscape correspond to link communities. Because this cost landscape has many local minima we define a valid community as the community with the lowest minimum within a certain range. Since finding all valid communities is impossible for large networks, we designed a memetic algorithm that combines probabilistic evolutionary strategies with deterministic local searches. We apply our approach to a network of about 15,000 Astronomy & Astrophysics papers published 2010 and their cited sources, and to a network of about 100,000 Astronomy & Astrophysics papers (published 2003--2010) which are linked through direct citations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/30/2020

Topics as Clusters of Citation Links to Highly Cited Sources: The Case of Research on International Relations

Following Henry Small in his approach to co-citation analysis, highly ci...
research
11/14/2021

Center-Periphery Structure in Communities: Extracellular Vesicles

Clustering and community detection in networks are of broad interest and...
research
10/24/2018

Communities as Well Separated Subgraphs With Cohesive Cores: Identification of Core-Periphery Structures in Link Communities

Communities in networks are commonly considered as highly cohesive subgr...
research
03/28/2013

Scalable Text and Link Analysis with Mixed-Topic Link Models

Many data sets contain rich information about objects, as well as pairwi...
research
04/01/2020

GitHub Repositories with Links to Academic Papers: Open Access, Traceability, and Evolution

Traceability between published scientific breakthroughs and their implem...
research
07/28/2020

Finding Scientific Communities In Citation Graphs: Convergent Clustering

Understanding the nature and organization of scientific communities is o...
research
07/25/2014

Two years of ALMA bibliography - lessons learned

Telescope bibliographies are integral parts of observing facilities. The...

Please sign up or login with your details

Forgot password? Click here to reset