AGATHA: Automatic Graph-mining And Transformer based Hypothesis generation Approach

02/13/2020
by   Justin Sybrandt, et al.
9

Medical research is risky and expensive. Drug discovery, as an example, requires that researchers efficiently winnow thousands of potential targets to a small candidate set for more thorough evaluation. However, research groups spend significant time and money to perform the experiments necessary to determine this candidate set long before seeing intermediate results. Hypothesis generation systems address this challenge by mining the wealth of publicly available scientific information to predict plausible research directions. We present AGATHA, a deep-learning hypothesis generation system that can introduce data-driven insights earlier in the discovery process. Through a learned ranking criteria, this system quickly prioritizes plausible term-pairs among entity sets, allowing us to recommend new research directions. We massively validate our system with a temporal holdout wherein we predict connections first introduced after 2015 using data published beforehand. We additionally explore biomedical sub-domains, and demonstrate AGATHA's predictive capacity across the twenty most popular relationship types. This system achieves best-in-class performance on an established benchmark, and demonstrates high recommendation scores across subdomains. Reproducibility: All code, experimental data, and pre-trained models are available online: sybrandt.com/2020/agatha

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/11/2018

Validation and Topic-driven Ranking for Biomedical Hypothesis Generation Systems

Literature underpins research, providing the foundation for new ideas. B...
research
06/05/2023

Literature-based Discovery for Landscape Planning

This project demonstrates how medical corpus hypothesis generation, a kn...
research
02/10/2021

Accelerating COVID-19 research with graph mining and transformer-based learning

In 2020, the White House released the, "Call to Action to the Tech Commu...
research
02/20/2017

MOLIERE: Automatic Biomedical Hypothesis Generation System

Hypothesis generation is becoming a crucial time-saving technique which ...
research
07/08/2022

GT4SD: Generative Toolkit for Scientific Discovery

With the growing availability of data within various scientific domains,...
research
10/22/2021

GeneDisco: A Benchmark for Experimental Design in Drug Discovery

In vitro cellular experimentation with genetic interventions, using for ...
research
10/05/2020

Temporal Positive-unlabeled Learning for Biomedical Hypothesis Generation via Risk Estimation

Understanding the relationships between biomedical terms like viruses, d...

Please sign up or login with your details

Forgot password? Click here to reset