On Disambiguating Authors: Collaboration Network Reconstruction in a Bottom-up Manner

11/29/2020
by   Na Li, et al.
0

Author disambiguation arises when different authors share the same name, which is a critical task in digital libraries, such as DBLP, CiteULike, CiteSeerX, etc. While the state-of-the-art methods have developed various paper embedding-based methods performing in a top-down manner, they primarily focus on the ego-network of a target name and overlook the low-quality collaborative relations existed in the ego-network. Thus, these methods can be suboptimal for disambiguating authors. In this paper, we model the author disambiguation as a collaboration network reconstruction problem, and propose an incremental and unsupervised author disambiguation method, namely IUAD, which performs in a bottom-up manner. Initially, we build a stable collaboration network based on stable collaborative relations. To further improve the recall, we build a probabilistic generative model to reconstruct the complete collaboration network. In addition, for newly published papers, we can incrementally judge who publish them via only computing the posterior probabilities. We have conducted extensive experiments on a large-scale DBLP dataset to evaluate IUAD. The experimental results demonstrate that IUAD not only achieves the promising performance, but also outperforms comparable baselines significantly. Codes are available at https://github.com/papergitgit/IUAD.

READ FULL TEXT
research
07/11/2022

Whois? Deep Author Name Disambiguation using Bibliographic Data

As the number of authors is increasing exponentially over years, the num...
research
03/17/2023

Deep Author Name Disambiguation using DBLP Data

In the academic world, the number of scientists grows every year and so ...
research
06/19/2017

Feature analysis of multidisciplinary scientific collaboration patterns based on PNAS

The features of collaboration patterns are often considered to be differ...
research
09/11/2018

Is together better? Examining scientific collaborations across multiple authors, institutions, and departments

Collaborations are an integral part of scientific research and publishin...
research
02/19/2021

Publication Trend in an Indian Journal and a Pakistan Journal: A Comparative Analysis using Scientometric Approach

Scientometric analysis of 146 and 59 research articles published in Indi...
research
11/07/2018

Scale-free collaboration networks: An author name disambiguation perspective

Several studies have found that collaboration networks are scale-free, p...
research
09/19/2023

A dynamic mean-field statistical model of academic collaboration

There is empirical evidence that collaboration in academia has increased...

Please sign up or login with your details

Forgot password? Click here to reset