Deep Author Name Disambiguation using DBLP Data

03/17/2023
by   Zeyd Boukhers, et al.
0

In the academic world, the number of scientists grows every year and so does the number of authors sharing the same names. Consequently, it challenging to assign newly published papers to their respective authors. Therefore, Author Name Ambiguity (ANA) is considered a critical open problem in digital libraries. This paper proposes an Author Name Disambiguation (AND) approach that links author names to their real-world entities by leveraging their co-authors and domain of research. To this end, we use data collected from the DBLP repository that contains more than 5 million bibliographic records authored by around 2.6 million co-authors. Our approach first groups authors who share the same last names and same first name initials. The author within each group is identified by capturing the relation with his/her co-authors and area of research, represented by the titles of the validated publications of the corresponding author. To this end, we train a neural network model that learns from the representations of the co-authors and titles. We validated the effectiveness of our approach by conducting extensive experiments on a large dataset.

READ FULL TEXT

page 11

page 12

research
07/11/2022

Whois? Deep Author Name Disambiguation using Bibliographic Data

As the number of authors is increasing exponentially over years, the num...
research
06/20/2018

Developing a Temporal Bibliographic Data Set for Entity Resolution

Entity resolution is the process of identifying groups of records within...
research
03/28/2023

Author-Unification: Name-, Institution-, and Career-Sharing Co-authors

In this work, we investigate the phenomenon of Author-UnificAtion (AUA),...
research
03/18/2020

A Dataset and an Approach for Identity Resolution of 38 Million Author IDs extracted from 2B Git Commits

The data collected from open source projects provide means to model larg...
research
04/05/2021

LAGOS-AND: A Large, Gold Standard Dataset for Scholarly Author Name Disambiguation

In this paper, we present a method to automatically generate a large-sca...
research
11/29/2020

On Disambiguating Authors: Collaboration Network Reconstruction in a Bottom-up Manner

Author disambiguation arises when different authors share the same name,...
research
08/31/2015

Ethnicity sensitive author disambiguation using semi-supervised learning

Author name disambiguation in bibliographic databases is the problem of ...

Please sign up or login with your details

Forgot password? Click here to reset