Evaluating Research Dataset Recommendations in a Living Lab

09/29/2022
by   Jüri Keller, et al.
0

The search for research datasets is as important as laborious. Due to the importance of the choice of research data in further research, this decision must be made carefully. Additionally, because of the growing amounts of data in almost all areas, research data is already a central artifact in empirical sciences. Consequentially, research dataset recommendations can beneficially supplement scientific publication searches. We formulated the recommendation task as a retrieval problem by focussing on broad similarities between research datasets and scientific publications. In a multistage approach, initial recommendations were retrieved by the BM25 ranking function and dynamic queries. Subsequently, the initial ranking was re-ranked utilizing click feedback and document embeddings. The proposed system was evaluated live on real user interaction data using the STELLA infrastructure in the LiLAS Lab at CLEF 2021. Our experimental system could efficiently be fine-tuned before the live evaluation by pre-testing the system with a pseudo test collection based on prior user interaction data from the live system. The results indicate that the experimental system outperforms the other participating systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/24/2020

ArXivDigest: A Living Lab for Personalized Scientific Literature Recommendation

Providing personalized recommendations that are also accompanied by expl...
research
03/10/2022

Overview of LiLAS 2021 – Living Labs for Academic Search

The Living Labs for Academic Search (LiLAS) lab aims to strengthen the c...
research
07/19/2018

Mr. DLib's Living Lab for Scholarly Recommendations

We introduce the first living lab for scholarly recommender systems. Thi...
research
06/07/2023

Good Data, Large Data, or No Data? Comparing Three Approaches in Developing Research Aspect Classifiers for Biomedical Papers

The rapid growth of scientific publications, particularly during the COV...
research
06/04/2020

Characteristics of Dataset Retrieval Sessions: Experiences from a Real-life Digital Library

Secondary analysis or the reuse of existing survey data is a common prac...
research
10/19/2021

EILEEN: A recommendation system for scientific publications and grants

Finding relevant scientific articles is crucial for advancing knowledge....
research
10/05/2022

Performing live time-traversal queries via SPARQL on RDF datasets

This article introduces a methodology to perform live time-traversal SPA...

Please sign up or login with your details

Forgot password? Click here to reset