Interactive Extractive Search over Biomedical Corpora

06/07/2020
by   Hillel Taub-Tabib, et al.
0

We present a system that allows life-science researchers to search a linguistically annotated corpus of scientific texts using patterns over dependency graphs, as well as using patterns over token sequences and a powerful variant of boolean keyword queries. In contrast to previous attempts to dependency-based search, we introduce a light-weight query language that does not require the user to know the details of the underlying linguistic representations, and instead to query the corpus by providing an example sentence coupled with simple markup. Search is performed at an interactive speed due to efficient linguistic graph-indexing and retrieval engine. This allows for rapid exploration, development and refinement of user queries. We demonstrate the system using example workflows over two corpora: the PubMed corpus including 14,446,243 PubMed abstracts and the CORD-19 dataset, a collection of over 45,000 research papers focused on COVID-19 research. The system is publicly available at https://allenai.github.io/spike

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/04/2020

Syntactic Search by Example

We present a system that allows a user to search a large linguistically ...
research
08/31/2021

A Search Engine for Discovery of Scientific Challenges and Directions

Keeping track of scientific challenges, advances and emerging directions...
research
10/24/2020

COUGH: A Challenge Dataset and Models for COVID-19 FAQ Retrieval

We present a large challenging dataset, COUGH, for COVID-19 FAQ retrieva...
research
06/08/2021

Neural Extractive Search

Domain experts often need to extract structured information from large c...
research
01/08/2015

The Hebrew Bible as Data: Laboratory - Sharing - Experiences

The systematic study of ancient texts including their production, transm...
research
09/10/2017

Improving average ranking precision in user searches for biomedical research datasets

Availability of research datasets is keystone for health and life scienc...
research
05/20/2020

SciSight: Combining faceted navigation and research group detection for COVID-19 exploratory scientific search

The COVID-19 pandemic has sparked unprecedented mobilization of scientis...

Please sign up or login with your details

Forgot password? Click here to reset