Semi-Supervised Hierarchical Drug Embedding in Hyperbolic Space

06/01/2020
by   Ke Yu, et al.
69

Learning accurate drug representation is essential for tasks such as computational drug repositioning and prediction of drug side-effects. A drug hierarchy is a valuable source that encodes human knowledge of drug relations in a tree-like structure where drugs that act on the same organs, treat the same disease, or bind to the same biological target are grouped together. However, its utility in learning drug representations has not yet been explored, and currently described drug representations cannot place novel molecules in a drug hierarchy. Here, we develop a semi-supervised drug embedding that incorporates two sources of information: (1) underlying chemical grammar that is inferred from molecular structures of drugs and drug-like molecules (unsupervised), and (2) hierarchical relations that are encoded in an expert-crafted hierarchy of approved drugs (supervised). We use the Variational Auto-Encoder (VAE) framework to encode the chemical structures of molecules and use the knowledge-based drug-drug similarity to induce the clustering of drugs in hyperbolic space. The hyperbolic space is amenable for encoding hierarchical concepts. Both quantitative and qualitative results support that the learned drug embedding can accurately reproduce the chemical structure and induce the hierarchical relations among drugs. Furthermore, our approach can infer the pharmacological properties of novel molecules by retrieving similar drugs from the embedding space. We demonstrate that the learned drug embedding can be used to find new uses for existing drugs and to discover side-effects. We show that it significantly outperforms baselines in both tasks.

READ FULL TEXT

page 2

page 7

page 10

page 17

research
03/11/2021

Scaffold Embeddings: Learning the Structure Spanned by Chemical Fragments, Scaffolds and Compounds

Molecules have seemed like a natural fit to deep learning's tendency to ...
research
03/22/2022

Hierarchical Graph Representation Learning for the Prediction of Drug-Target Binding Affinity

The identification of drug-target binding affinity (DTA) has attracted i...
research
11/22/2019

Approaching Small Molecule Prioritization as a Cross-Modal Information Retrieval Task through Coordinated Representation Learning

Modeling the relationship between chemical structure and molecular activ...
research
03/30/2021

Unsupervised Hyperbolic Representation Learning via Message Passing Auto-Encoders

Most of the existing literature regarding hyperbolic embedding concentra...
research
09/18/2021

MM-Deacon: Multimodal molecular domain embedding analysis via contrastive learning

Molecular representation learning plays an essential role in cheminforma...
research
04/16/2020

Network-principled deep generative models for designing drug combinations as graph sets

Combination therapy has shown to improve therapeutic efficacy while redu...
research
01/12/2021

AI- and HPC-enabled Lead Generation for SARS-CoV-2: Models and Processes to Extract Druglike Molecules Contained in Natural Language Text

Researchers worldwide are seeking to repurpose existing drugs or discove...

Please sign up or login with your details

Forgot password? Click here to reset