Co-citation and Co-authorship Networks of Statisticians

04/24/2022
by   Pengsheng Ji, et al.
0

We collected and cleaned a large data set on publications in statistics. The data set consists of the coauthor relationships and citation relationships of 83, 331 papers published in 36 representative journals in statistics, probability, and machine learning, spanning 41 years. The data set allows us to construct many different networks, and motivates a number of research problems about the research patterns and trends, research impacts, and network topology of the statistics community. In this paper we focus on (i) using the citation relationships to estimate the research interests of authors, and (ii) using the coauthor relationships to study the network topology. Using co-citation networks we constructed, we discover a "statistics triangle", reminiscent of the statistical philosophy triangle (Efron, 1998). We propose new approaches to constructing the "research map" of statisticians, as well as the "research trajectory" for a given author to visualize his/her research interest evolvement. Using co-authorship networks we constructed, we discover a multi-layer community tree and produce a Sankey diagram to visualize the author migrations in different sub-areas. We also propose several new metrics for research diversity of individual authors. We find that "Bayes", "Biostatistics", and "Nonparametric" are three primary areas in statistics. We also identify 15 sub-areas, each of which can be viewed as a weighted average of the primary areas, and identify several underlying reasons for the formation of co-authorship communities. We also find that the research interests of statisticians have evolved significantly in the 41-year time window we studied: some areas (e.g., biostatistics, high-dimensional data analysis, etc.) have become increasingly more popular.

READ FULL TEXT
research
08/22/2023

Large-scale Multi-layer Academic Networks Derived from Statistical Publications

The utilization of multi-layer network structures now enables the explan...
research
09/21/2016

Bibliographic Analysis on Research Publications using Authors, Categorical Labels and the Citation Network

Bibliographic analysis considers the author's research areas, the citati...
research
09/22/2016

Bibliographic Analysis with the Citation Network Topic Model

Bibliographic analysis considers author's research areas, the citation n...
research
06/05/2019

Large publishing consortia produce higher citation impact research but co-author contributions are hard to evaluate

This paper introduces a simple agglomerative clustering method to identi...
research
04/25/2022

Impact of Geographic Diversity on Citation of Collaborative Research

Diversity in human capital is widely seen as critical to creating holist...
research
02/10/2021

The effects of citation-based research evaluation schemes on self-citation behavior

We investigate the changes in the self-citation behavior of Italian prof...
research
08/15/2012

Statistical Common Author Networks (SCAN)

A new method for visualizing the relatedness of scientific areas is deve...

Please sign up or login with your details

Forgot password? Click here to reset