CheMixNet: Mixed DNN Architectures for Predicting Chemical Properties using Multiple Molecular Representations

11/14/2018
by   Arindam Paul, et al.
0

SMILES is a linear representation of chemical structures which encodes the connection table, and the stereochemistry of a molecule as a line of text with a grammar structure denoting atoms, bonds, rings and chains, and this information can be used to predict chemical properties. Molecular fingerprints are representations of chemical structures, successfully used in similarity search, clustering, classification, drug discovery, and virtual screening and are a standard and computationally efficient abstract representation where structural features are represented as a bit string. Both SMILES and molecular fingerprints are different representations for describing the structure of a molecule. There exist several predictive models for learning chemical properties based on either SMILES or molecular fingerprints. Here, our goal is to build predictive models that can leverage both these molecular representations. In this work, we present CheMixNet -- a set of neural networks for predicting chemical properties from a mixture of features learned from the two molecular representations -- SMILES as sequences and molecular fingerprints as vector inputs. We demonstrate the efficacy of CheMixNet architectures by evaluating on six different datasets. The proposed CheMixNet models not only outperforms the candidate neural architectures such as contemporary fully connected networks that uses molecular fingerprints and 1-D CNN and RNN models trained SMILES sequences, but also other state-of-the-art architectures such as Chemception and Molecular Graph Convolutions.

READ FULL TEXT

page 4

page 8

research
02/19/2021

MolCLR: Molecular Contrastive Learning of Representations via Graph Neural Networks

Molecular machine learning bears promise for efficient molecule property...
research
03/28/2019

Step Change Improvement in ADMET Prediction with PotentialNet Deep Featurization

The Absorption, Distribution, Metabolism, Elimination, and Toxicity (ADM...
research
04/01/2020

DeepSIBA: Chemical Structure-based Inference of Biological Alterations

Predicting whether a chemical structure shares a desired biological effe...
research
04/16/2017

Deep Learning Based Regression and Multi-class Models for Acute Oral Toxicity Prediction with Automatic Chemical Feature Extraction

For quantitative structure-property relationship (QSPR) studies in chemo...
research
07/24/2013

Electronic Visualisation in Chemistry: From Alchemy to Art

Chemists now routinely use software as part of their work. For example, ...
research
08/19/2016

Space-Filling Curves as a Novel Crystal Structure Representation for Machine Learning Models

A fundamental problem in applying machine learning techniques for chemic...
research
09/27/2022

Machine learning-accelerated chemistry modeling of protoplanetary disks

Aims. With the large amount of molecular emission data from (sub)millime...

Please sign up or login with your details

Forgot password? Click here to reset