ASGN: An Active Semi-supervised Graph Neural Network for Molecular Property Prediction

07/07/2020
by   Zhongkai Hao, et al.
6

Molecular property prediction (e.g., energy) is an essential problem in chemistry and biology. Unfortunately, many supervised learning methods usually suffer from the problem of scarce labeled molecules in the chemical space, where such property labels are generally obtained by Density Functional Theory (DFT) calculation which is extremely computational costly. An effective solution is to incorporate the unlabeled molecules in a semi-supervised fashion. However, learning semi-supervised representation for large amounts of molecules is challenging, including the joint representation issue of both molecular essence and structure, the conflict between representation and property leaning. Here we propose a novel framework called Active Semi-supervised Graph Neural Network (ASGN) by incorporating both labeled and unlabeled molecules. Specifically, ASGN adopts a teacher-student framework. In the teacher model, we propose a novel semi-supervised learning method to learn general representation that jointly exploits information from molecular structure and molecular distribution. Then in the student model, we target at property prediction task to deal with the learning loss conflict. At last, we proposed a novel active learning strategy in terms of molecular diversities to select informative data during the whole framework learning. We conduct extensive experiments on several public datasets. Experimental results show the remarkable performance of our ASGN framework.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/16/2021

Few-Shot Graph Learning for Molecular Property Prediction

The recent success of graph neural networks has significantly boosted mo...
research
06/17/2021

Do Large Scale Molecular Language Representations Capture Important Structural Information?

Predicting chemical properties from the structure of a molecule is of gr...
research
02/03/2019

Bayesian semi-supervised learning for uncertainty-calibrated prediction of molecular properties and active learning

Predicting bioactivity and physical properties of small molecules is a c...
research
05/23/2022

Tyger: Task-Type-Generic Active Learning for Molecular Property Prediction

How to accurately predict the properties of molecules is an essential pr...
research
02/04/2023

Harnessing Simulation for Molecular Embeddings

While deep learning has unlocked advances in computational biology once ...
research
07/14/2020

Semi-supervised Learning with a Teacher-student Network for Generalized Attribute Prediction

This paper presents a study on semi-supervised learning to solve the vis...
research
11/28/2017

Semi-supervised learning of hierarchical representations of molecules using neural message passing

With the rapid increase of compound databases available in medicinal and...

Please sign up or login with your details

Forgot password? Click here to reset