Transformer Query-Target Knowledge Discovery (TEND): Drug Discovery from CORD-19

11/28/2020
by   Leo K. Tam, et al.
0

Previous work established skip-gram word2vec models could be used to mine knowledge in the materials science literature for the discovery of thermoelectrics. Recent transformer architectures have shown great progress in language modeling and associated fine-tuned tasks, but they have yet to be adapted for drug discovery. We present a RoBERTa transformer-based method that extends the masked language token prediction using query-target conditioning to treat the specificity challenge. The transformer discovery method entails several benefits over the word2vec method including domain-specific (antiviral) analogy performance, negation handling, and flexible query analysis (specific) and is demonstrated on influenza drug discovery. To stimulate COVID-19 research, we release an influenza clinical trials and antiviral analogies dataset used in conjunction with the COVID-19 Open Research Dataset Challenge (CORD-19) literature dataset in the study. We examine k-shot fine-tuning to improve the downstream analogies performance as well as to mine analogies for model explainability. Further, the query-target analysis is verified in a forward chaining analysis against the influenza drug clinical trials dataset, before adapted for COVID-19 drugs (combinations and side-effects) and on-going clinical trials. In consideration of the present topic, we release the model, dataset, and code.

READ FULL TEXT
research
01/22/2021

Drug and Disease Interpretation Learning with Biomedical Entity Representation Transformer

Concept normalization in free-form texts is a crucial step in every text...
research
03/06/2023

Enhancing Activity Prediction Models in Drug Discovery with the Ability to Understand Human Language

Activity and property prediction models are the central workhorses in dr...
research
04/19/2020

DeepPurpose: a Deep Learning Based Drug Repurposing Toolkit

We present DeepPurpose, a deep learning toolkit for simple and efficient...
research
07/20/2020

Few-shot link prediction via graph neural networks for Covid-19 drug-repurposing

Predicting interactions among heterogenous graph structured data has num...
research
09/04/2021

Characterizing interdisciplinarity in drug research: a translational science perspective

Despite the significant advances in life science, it still takes decades...
research
05/17/2021

Understanding the Performance of Knowledge Graph Embeddings in Drug Discovery

Knowledge Graphs (KG) and associated Knowledge Graph Embedding (KGE) mod...

Please sign up or login with your details

Forgot password? Click here to reset