Structure to Property: Chemical Element Embeddings and a Deep Learning Approach for Accurate Prediction of Chemical Properties

09/17/2023
by   Shokirbek Shermukhamedov, et al.
0

The application of machine learning (ML) techniques in computational chemistry has led to significant advances in predicting molecular properties, accelerating drug discovery, and material design. ML models can extract hidden patterns and relationships from complex and large datasets, allowing for the prediction of various chemical properties with high accuracy. The use of such methods has enabled the discovery of molecules and materials that were previously difficult to identify. This paper introduces a new ML model based on deep learning techniques, such as a multilayer encoder and decoder architecture, for classification tasks. We demonstrate the opportunities offered by our approach by applying it to various types of input data, including organic and inorganic compounds. In particular, we developed and tested the model using the Matbench and Moleculenet benchmarks, which include crystal properties and drug design-related benchmarks. We also conduct a comprehensive analysis of vector representations of chemical compounds, shedding light on the underlying patterns in molecular data. The models used in this work exhibit a high degree of predictive power, underscoring the progress that can be made with refined machine learning when applied to molecular and material datasets. For instance, on the Tox21 dataset, we achieved an average accuracy of 96 publicly available at https://github.com/dmamur/elembert.

READ FULL TEXT

page 4

page 5

page 6

page 7

page 8

page 9

research
04/10/2023

SELFormer: Molecular Representation Learning via SELFIES Language Models

Automated computational analysis of the vast chemical space is critical ...
research
11/01/2018

Independent Vector Analysis for Data Fusion Prior to Molecular Property Prediction with Machine Learning

Due to its high computational speed and accuracy compared to ab-initio q...
research
02/11/2020

Predicting drug properties with parameter-free machine learning: Pareto-Optimal Embedded Modeling (POEM)

The prediction of absorption, distribution, metabolism, excretion, and t...
research
11/30/2021

AugLiChem: Data Augmentation Library of Chemical Structures for Machine Learning

Machine learning (ML) has demonstrated the promise for accurate and effi...
research
06/12/2019

Attention-based Multi-Input Deep Learning Architecture for Biological Activity Prediction: An Application in EGFR Inhibitors

Machine learning and deep learning have gained popularity and achieved i...
research
04/16/2017

Deep Learning Based Regression and Multi-class Models for Acute Oral Toxicity Prediction with Automatic Chemical Feature Extraction

For quantitative structure-property relationship (QSPR) studies in chemo...
research
02/07/2022

A Machine Learning Approach for Material Type Logging and Chemical Assaying from Autonomous Measure-While-Drilling (MWD) Data

Understanding the structure and mineralogical composition of a region is...

Please sign up or login with your details

Forgot password? Click here to reset