Using Elasticsearch for entity recognition in affiliation disambiguation

10/05/2021
by   Anne L'Hôte, et al.
0

Automatic recognition of affiliations in the metadata of scholarly publications is a key point for monitoring and analyzing trends in scientific production, especially in an open science context. We propose an automatic alignment method on registries, based on Elasticsearch. The proposed method is modular and leaves the choice of the alignment criteria to the user, allowing him to keep control over the precision and recall of the method. An implementation is proposed for an automatic alignment on three registries: countries, GRID.ac and RNSR (research laboratory directory in France) on the Github https://github.com/dataesr/matcher and the performances are analyzed in this paper.

READ FULL TEXT

page 1

page 2

page 3

page 4

04/14/2021

Monitoring Open Access at a national level: French case study

After the launch of multiple plans for Open Science, there is now a need...
07/19/2020

Trends in Cuban research output: publications and patents

Cuban science and technology are known for important achievements, parti...
11/19/2020

Entity Recognition and Relation Extraction from Scientific and Technical Texts in Russian

This paper is devoted to the study of methods for information extraction...
06/11/2020

High-Precision Extraction of Emerging Concepts from Scientific Literature

Identification of new concepts in scientific literature can help power f...
04/30/2021

Content-based subject classification at article level in biomedical context

Subject classification is an important task to analyze scholarly publica...
07/01/2021

Automatic Metadata Extraction Incorporating Visual Features from Scanned Electronic Theses and Dissertations

Electronic Theses and Dissertations (ETDs) contain domain knowledge that...
11/11/2020

Cryo-RALib – a modular library for accelerating alignment in cryo-EM

With the enhancement of algorithms, cryo-EM has become the most efficien...