MOLIERE: Automatic Biomedical Hypothesis Generation System

02/20/2017
by   Justin Sybrandt, et al.
0

Hypothesis generation is becoming a crucial time-saving technique which allows biomedical researchers to quickly discover implicit connections between important concepts. Typically, these systems operate on domain-specific fractions of public medical data. MOLIERE, in contrast, utilizes information from over 24.5 million documents. At the heart of our approach lies a multi-modal and multi-relational network of biomedical objects extracted from several heterogeneous datasets from the National Center for Biotechnology Information (NCBI). These objects include but are not limited to scientific papers, keywords, genes, proteins, diseases, and diagnoses. We model hypotheses using Latent Dirichlet Allocation applied on abstracts found near shortest paths discovered within this network, and demonstrate the effectiveness of MOLIERE by performing hypothesis generation on historical data. Our network, implementation, and resulting data are all publicly available for the broad scientific community.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/10/2021

Accelerating COVID-19 research with graph mining and transformer-based learning

In 2020, the White House released the, "Call to Action to the Tech Commu...
research
02/13/2020

CBAG: Conditional Biomedical Abstract Generation

Biomedical research papers use significantly different language and jarg...
research
02/13/2020

AGATHA: Automatic Graph-mining And Transformer based Hypothesis generation Approach

Medical research is risky and expensive. Drug discovery, as an example, ...
research
10/05/2020

Temporal Positive-unlabeled Learning for Biomedical Hypothesis Generation via Risk Estimation

Understanding the relationships between biomedical terms like viruses, d...
research
02/11/2018

Validation and Topic-driven Ranking for Biomedical Hypothesis Generation Systems

Literature underpins research, providing the foundation for new ideas. B...
research
06/20/2023

Harnessing the Power of Adversarial Prompting and Large Language Models for Robust Hypothesis Generation in Astronomy

This study investigates the application of Large Language Models (LLMs),...
research
06/28/2020

A method to calculate Great Britains half-hourly electrical demand from publicly available data

Publicly available electrical generation and interconnector data is comb...

Please sign up or login with your details

Forgot password? Click here to reset