Hierarchical Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction

09/04/2023
by   Minghao Guo, et al.
0

The prediction of molecular properties is a crucial task in the field of material and drug discovery. The potential benefits of using deep learning techniques are reflected in the wealth of recent literature. Still, these techniques are faced with a common challenge in practice: Labeled data are limited by the cost of manual extraction from literature and laborious experimentation. In this work, we propose a data-efficient property predictor by utilizing a learnable hierarchical molecular grammar that can generate molecules from grammar production rules. Such a grammar induces an explicit geometry of the space of molecular graphs, which provides an informative prior on molecular structural similarity. The property prediction is performed using graph neural diffusion over the grammar-induced geometry. On both small and large datasets, our evaluation shows that this approach outperforms a wide spectrum of baselines, including supervised and pre-trained graph neural networks. We include a detailed ablation study and further analysis of our solution, showing its effectiveness in cases with extremely limited data. Code is available at https://github.com/gmh14/Geo-DEG.

READ FULL TEXT
research
03/15/2022

Data-Efficient Graph Grammar Learning for Molecular Generation

The problem of molecular generation has received significant attention r...
research
08/30/2022

HiGNN: Hierarchical Informative Graph Neural Networks for Molecular Property Prediction Equipped with Feature-Wise Attention

Elucidating and accurately predicting the druggability and bioactivities...
research
06/28/2021

LiteGEM: Lite Geometry Enhanced Molecular Representation Learning for Quantum Property Prediction

In this report, we (SuperHelix team) present our solution to KDD Cup 202...
research
03/23/2022

Geometry-Aware Supertagging with Heterogeneous Dynamic Convolutions

The syntactic categories of categorial grammar formalisms are structured...
research
11/07/2022

Retention Time Prediction for Chromatographic Enantioseparation by Quantile Geometry-enhanced Graph Neural Network

A new research framework is proposed to incorporate machine learning tec...
research
06/05/2019

Probabilistic hypergraph grammars for efficient molecular optimization

We present an approach to make molecular optimization more efficient. We...
research
02/04/2023

Harnessing Simulation for Molecular Embeddings

While deep learning has unlocked advances in computational biology once ...

Please sign up or login with your details

Forgot password? Click here to reset