S2AND: A Benchmark and Evaluation System for Author Name Disambiguation

03/12/2021
by   Shivashankar Subramanian, et al.
0

Author Name Disambiguation (AND) is the task of resolving which author mentions in a bibliographic database refer to the same real-world person, and is a critical ingredient of digital library applications such as search and citation analysis. While many AND algorithms have been proposed, comparing them is difficult because they often employ distinct features and are evaluated on different datasets. In response to this challenge, we present S2AND, a unified benchmark dataset for AND on scholarly papers, as well as an open-source reference model implementation. Our dataset harmonizes eight disparate AND datasets into a uniform format, with a single rich feature set drawn from the Semantic Scholar (S2) database. Our evaluation suite for S2AND reports performance split by facets like publication year and number of papers, allowing researchers to track both global performance and measures of fairness across facet values. Our experiments show that because previous datasets tend to cover idiosyncratic and biased slices of the literature, algorithms trained to perform well on one on them may generalize poorly to others. By contrast, we show how training on a union of datasets in S2AND results in more robust models that perform well even on datasets unseen in training. The resulting AND model also substantially improves over the production algorithm in S2, reducing error by over 50 trained models, and evaluation suite to the research community. https://github.com/allenai/S2AND/

READ FULL TEXT

page 1

page 8

research
01/22/2019

Discovering seminal works with marker papers

Bibliometric information retrieval in databases can employ different str...
research
09/07/2023

A-Eval: A Benchmark for Cross-Dataset Evaluation of Abdominal Multi-Organ Segmentation

Although deep learning have revolutionized abdominal multi-organ segment...
research
07/01/2021

Proof of Reference(PoR): A unified informetrics based consensus mechanism

Bibliometrics is useful to analyze the research impact for measuring the...
research
09/02/2019

The CL-SciSumm Shared Task 2018: Results and Key Insights

This overview describes the official results of the CL-SciSumm Shared Ta...
research
03/06/2020

Deep Learning Algorithms for Rotating Machinery Intelligent Diagnosis: An Open Source Benchmark Study

With the development of artificial intelligence and deep learning (DL) t...
research
06/16/2022

Pythae: Unifying Generative Autoencoders in Python – A Benchmarking Use Case

In recent years, deep generative models have attracted increasing intere...

Please sign up or login with your details

Forgot password? Click here to reset