Using Elasticsearch for entity recognition in affiliation disambiguation

by   Anne L'Hôte, et al.

Automatic recognition of affiliations in the metadata of scholarly publications is a key point for monitoring and analyzing trends in scientific production, especially in an open science context. We propose an automatic alignment method on registries, based on Elasticsearch. The proposed method is modular and leaves the choice of the alignment criteria to the user, allowing him to keep control over the precision and recall of the method. An implementation is proposed for an automatic alignment on three registries: countries, and RNSR (research laboratory directory in France) on the Github and the performances are analyzed in this paper.


page 1

page 2

page 3

page 4


Monitoring Open Access at a national level: French case study

After the launch of multiple plans for Open Science, there is now a need...

Trends in Cuban research output: publications and patents

Cuban science and technology are known for important achievements, parti...

Entity Recognition and Relation Extraction from Scientific and Technical Texts in Russian

This paper is devoted to the study of methods for information extraction...

High-Precision Extraction of Emerging Concepts from Scientific Literature

Identification of new concepts in scientific literature can help power f...

Content-based subject classification at article level in biomedical context

Subject classification is an important task to analyze scholarly publica...

Automatic Metadata Extraction Incorporating Visual Features from Scanned Electronic Theses and Dissertations

Electronic Theses and Dissertations (ETDs) contain domain knowledge that...

Cryo-RALib – a modular library for accelerating alignment in cryo-EM

With the enhancement of algorithms, cryo-EM has become the most efficien...