How Much Chemistry Does a Deep Neural Network Need to Know to Make Accurate Predictions?

10/05/2017
by   Garrett B. Goh, et al.
0

In the last few years, we have seen the rise of deep learning applications in a broad range of chemistry research problems. Recently, we reported on the development of Chemception, a deep convolutional neural network (CNN) architecture for general-purpose small molecule property prediction. In this work, we investigate the effects of systematically removing and adding basic chemical information to the image channels of the 2D images used to train Chemception. By augmenting images with only 3 additional basic chemical information, we demonstrate that Chemception now outperforms contemporary deep learning models trained on more sophisticated chemical representations (molecular fingerprints) for the prediction of toxicity, activity, and solvation free energy, as well as physics-based free energy simulation methods. Thus, our work demonstrates that a firm grasp of first-principles chemical knowledge is not a pre-requisite for deep learning models to accurately predict chemical properties. Lastly, by altering the chemical information content in the images, and examining the resulting performance of Chemception, we also identify two different learning patterns in predicting toxicity/activity as compared to solvation free energy, and these patterns suggest that Chemception is learning about its tasks in the manner that is consistent with established knowledge.

READ FULL TEXT

page 9

page 12

page 16

page 19

page 20

page 23

research
06/20/2017

Chemception: A Deep Neural Network with Minimal Chemistry Knowledge Matches the Performance of Expert-developed QSAR/QSPR Models

In the last few years, we have seen the transformative impact of deep le...
research
12/06/2017

SMILES2Vec: An Interpretable General-Purpose Deep Neural Network for Predicting Chemical Properties

Chemical databases store information in text representations, and the SM...
research
02/21/2022

Ligandformer: A Graph Neural Network for Predicting Compound Property with Robust Interpretation

Robust and efficient interpretation of QSAR methods is quite useful to v...
research
12/07/2017

Using Rule-Based Labels for Weak Supervised Learning: A ChemNet for Transferable Chemical Property Prediction

With access to large datasets, deep neural networks (DNN) have achieved ...
research
11/15/2022

ParticleGrid: Enabling Deep Learning using 3D Representation of Materials

From AlexNet to Inception, autoencoders to diffusion models, the develop...
research
10/07/2021

Predicting Chemical Hazard across Taxa through Machine Learning

We apply machine learning methods to predict chemical hazards focusing o...
research
10/12/2022

When does deep learning fail and how to tackle it? A critical analysis on polymer sequence-property surrogate models

Deep learning models are gaining popularity and potency in predicting po...

Please sign up or login with your details

Forgot password? Click here to reset