Large-Scale Knowledge Synthesis and Complex Information Retrieval from Biomedical Documents

02/14/2023
by   Shreya Saxena, et al.
0

Recent advances in the healthcare industry have led to an abundance of unstructured data, making it challenging to perform tasks such as efficient and accurate information retrieval at scale. Our work offers an all-in-one scalable solution for extracting and exploring complex information from large-scale research documents, which would otherwise be tedious. First, we briefly explain our knowledge synthesis process to extract helpful information from unstructured text data of research documents. Then, on top of the knowledge extracted from the documents, we perform complex information retrieval using three major components- Paragraph Retrieval, Triplet Retrieval from Knowledge Graphs, and Complex Question Answering (QA). These components combine lexical and semantic-based methods to retrieve paragraphs and triplets and perform faceted refinement for filtering these search results. The complexity of biomedical queries and documents necessitates using a QA system capable of handling queries more complex than factoid queries, which we evaluate qualitatively on the COVID-19 Open Research Dataset (CORD-19) to demonstrate the effectiveness and value-add.

READ FULL TEXT

page 1

page 6

research
02/10/2021

Biomedical Question Answering: A Comprehensive Review

Question Answering (QA) is a benchmark Natural Language Processing (NLP)...
research
11/08/2022

COV19IR : COVID-19 Domain Literature Information Retrieval

Increasing number of COVID-19 research literatures cause new challenges ...
research
08/09/2023

Building Interpretable and Reliable Open Information Retriever for New Domains Overnight

Information retrieval (IR) or knowledge retrieval, is a critical compone...
research
12/04/2019

WIKIR: A Python toolkit for building a large-scale Wikipedia-based English Information Retrieval Dataset

Over the past years, deep learning methods allowed for new state-of-the-...
research
08/06/2023

Embedding-based Retrieval with LLM for Effective Agriculture Information Extracting from Unstructured Data

Pest identification is a crucial aspect of pest control in agriculture. ...
research
10/17/2019

Indoor Information Retrieval using Lifelog Data

Studying human behaviour through lifelogging has seen an increase in att...
research
07/24/2020

COVID-19 Knowledge Graph: Accelerating Information Retrieval and Discovery for Scientific Literature

The coronavirus disease (COVID-19) has claimed the lives of over 350,000...

Please sign up or login with your details

Forgot password? Click here to reset