Keyphrase Generation for Scientific Document Retrieval

06/28/2021
by   Florian Boudin, et al.
0

Sequence-to-sequence models have lead to significant progress in keyphrase generation, but it remains unknown whether they are reliable enough to be beneficial for document retrieval. This study provides empirical evidence that such models can significantly improve retrieval performance, and introduces a new extrinsic evaluation framework that allows for a better understanding of the limitations of keyphrase generation models. Using this framework, we point out and discuss the difficulties encountered with supplementing documents with – not present in text – keyphrases, and generalizing models across domains. Our code is available at https://github.com/boudinfl/ir-using-kg

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2023

DAPR: A Benchmark on Document-Aware Passage Retrieval

Recent neural retrieval mainly focuses on ranking short texts and is cha...
research
07/19/2023

IncDSI: Incrementally Updatable Document Retrieval

Differentiable Search Index is a recently proposed paradigm for document...
research
01/09/2023

Doc2Query–: When Less is More

Doc2Query – the process of expanding the content of a document before in...
research
03/23/2021

Redefining Absent Keyphrases and their Effect on Retrieval Effectiveness

Neural keyphrase generation models have recently attracted much interest...
research
11/09/2020

Adversarial Semantic Collisions

We study semantic collisions: texts that are semantically unrelated but ...
research
06/08/2023

RETA-LLM: A Retrieval-Augmented Large Language Model Toolkit

Although Large Language Models (LLMs) have demonstrated extraordinary ca...
research
08/20/2020

PARADE: Passage Representation Aggregation for Document Reranking

We present PARADE, an end-to-end Transformer-based model that considers ...

Please sign up or login with your details

Forgot password? Click here to reset