The DELICES project: Indexing scientific literature through semantic expansion

by   Florian Boudin, et al.

Scientific digital libraries play a critical role in the development and dissemination of scientific literature. Despite dedicated search engines, retrieving relevant publications from the ever-growing body of scientific literature remains challenging and time-consuming. Indexing scientific articles is indeed a difficult matter, and current models solely rely on a small portion of the articles (title and abstract) and on author-assigned keyphrases when available. This results in a frustratingly limited access to scientific knowledge. The goal of the DELICES project is to address this pitfall by exploiting semantic relations between scientific articles to both improve and enrich indexing. To this end, we will rely on the latest advances in semantic representations to both increase the relevance of keyphrases extracted from the documents, and extend indexing to new terms borrowed from semantically similar documents.


page 1

page 2


Cascading Citation Expansion

Digital Science's Dimensions is envisaged as a next-generation research ...

Towards a Semantic Search Engine for Scientific Articles

Because of the data deluge in scientific publication, finding relevant i...

Doc2Vec on the PubMed corpus: study of a new approach to generate related articles

PubMed is the biggest and most used bibliographic database worldwide, ho...

A Joint Learning Approach based on Self-Distillation for Keyphrase Extraction from Scientific Documents

Keyphrase extraction is the task of extracting a small set of phrases th...

Semantic and Relational Spaces in Science of Science: Deep Learning Models for Article Vectorisation

Over the last century, we observe a steady and exponentially growth of s...

How scientific literature has been evolving over the time? A novel statistical approach using tracking verbal-based methods

This paper provides a global vision of the scientific publications relat...

COVID-19 Knowledge Graph: Accelerating Information Retrieval and Discovery for Scientific Literature

The coronavirus disease (COVID-19) has claimed the lives of over 350,000...