Icospherical Chemical Objects (ICOs) allow for chemical data augmentation and maintain rotational, translation and permutation invariance

04/15/2023
by   Ella Gale, et al.
0

Dataset augmentation is a common way to deal with small datasets; Chemistry datasets are often small. Spherical convolutional neural networks (SphNNs) and Icosahedral neural networks (IcoNNs) are a type of geometric machine learning algorithm that maintains rotational symmetry. Molecular structure has rotational invariance and is inherently 3-D, and thus we need 3-D encoding methods to input molecular structure into machine learning. In this paper I present Icospherical Chemical Objects (ICOs) that enable the encoding of 3-D data in a rotationally invariant way which works with spherical or icosahedral neural networks and allows for dataset augmentation. I demonstrate the ICO featurisation method on the following tasks: predicting general molecular properties, predicting solubility of drug like molecules and the protein binding problem and find that ICO and SphNNs perform well on all problems.

READ FULL TEXT

page 3

page 6

page 7

page 8

page 11

page 13

page 14

research
10/08/2021

Learning 3D Representations of Molecular Chirality with Invariance to Bond Rotations

Molecular chirality, a form of stereochemistry most often describing rel...
research
02/21/2023

Machine learning for the prediction of safe and biologically active organophosphorus molecules

Drug discovery is a complex process with a large molecular space to be c...
research
12/11/2018

Synergy Effect between Convolutional Neural Networks and the Multiplicity of SMILES for Improvement of Molecular Prediction

In our study, we demonstrate the synergy effect between convolutional ne...
research
07/22/2021

Size doesn't matter: predicting physico- or biochemical properties based on dozens of molecules

The use of machine learning in chemistry has become a common practice. A...
research
03/27/2023

HD-Bind: Encoding of Molecular Structure with Low Precision, Hyperdimensional Binary Representations

Publicly available collections of drug-like molecules have grown to comp...
research
10/22/2020

Learning Invariances in Neural Networks

Invariances to translations have imbued convolutional neural networks wi...
research
05/23/2022

MolMiner: You only look once for chemical structure recognition

Molecular structures are always depicted as 2D printed form in scientifi...

Please sign up or login with your details

Forgot password? Click here to reset