Coverage of Author Identifiers in Web of Science and Scopus

by   Thomas Krämer, et al.

As digital collections of scientific literature are widespread and used frequently in knowledge-intense working environments, it has become a challenge to identify author names correctly. The treatment of homonyms is crucial for the reliable resolution of author names. Apart from varying handling of first, middle and last names, vendors as well as the digital library community created tools to address the problem of author name disambiguation. This technical report focuses on two widespread collections of scientific literature, Web of Science (WoS) and Scopus, and the coverage with author identification information such as Researcher ID, ORCID and Scopus Author Identifier in the period 1996 - 2014. The goal of this study is to describe the significant differences of the two collections with respect to overall distribution of author identifiers and its use across different subject domains. We found that the STM disciplines show the best coverage of author identifiers in our dataset of 6,032,000 publications which are both covered by WoS and Scopus. In our dataset we found 184,823 distinct ResearcherIDs and 70,043 distinct ORCIDs. In the appendix of this report we list a complete overview of all WoS subject areas and the amount of author identifiers in these subject areas.


page 1

page 2

page 3

page 4


Evaluating author name disambiguation for digital libraries: A case of DBLP

Author name ambiguity in a digital library may affect the findings of re...

Author name disambiguation of bibliometric data: A comparison of several unsupervised approaches

Adequately disambiguating author names in bibliometric databases is a pr...

A Bayesian Learning, Greedy agglomerative clustering approach and evaluation techniques for Author Name Disambiguation Problem

Author names often suffer from ambiguity owing to the same author appear...

Effective Unsupervised Author Disambiguation with Relative Frequencies

This work addresses the problem of author name homonymy in the Web of Sc...

Author Name Disambiguation by Using Deep Neural Network

Author name ambiguity decreases the quality and reliability of informati...

Ethnicity sensitive author disambiguation using semi-supervised learning

Author name disambiguation in bibliographic databases is the problem of ...

Mathematics of Human Motion: from Animation towards Simulation (A View form the Outside)

Simulation of human motion is the subject of study in a number of discip...

Please sign up or login with your details

Forgot password? Click here to reset