Predicting drug properties with parameter-free machine learning: Pareto-Optimal Embedded Modeling (POEM)

02/11/2020
by   Andrew E. Brereton, et al.
0

The prediction of absorption, distribution, metabolism, excretion, and toxicity (ADMET) of small molecules from their molecular structure is a central problem in medicinal chemistry with great practical importance in drug discovery. Creating predictive models conventionally requires substantial trial-and-error for the selection of molecular representations, machine learning (ML) algorithms, and hyperparameter tuning. A generally applicable method that performs well on all datasets without tuning would be of great value but is currently lacking. Here, we describe Pareto-Optimal Embedded Modeling (POEM), a similarity-based method for predicting molecular properties. POEM is a non-parametric, supervised ML algorithm developed to generate reliable predictive models without need for optimization. POEMs predictive strength is obtained by combining multiple different representations of molecular structures in a context-specific manner, while maintaining low dimensionality. We benchmark POEM relative to industry-standard ML algorithms and published results across 17 classifications tasks. POEM performs well in all cases and reduces the risk of overfitting.

READ FULL TEXT

page 16

page 18

page 23

page 29

page 30

research
09/17/2023

Structure to Property: Chemical Element Embeddings and a Deep Learning Approach for Accurate Prediction of Chemical Properties

The application of machine learning (ML) techniques in computational che...
research
03/07/2022

Prediction of transport property via machine learning molecular movements

Molecular dynamics (MD) simulations are increasingly being combined with...
research
02/16/2022

TorchDrug: A Powerful and Flexible Machine Learning Platform for Drug Discovery

Machine learning has huge potential to revolutionize the field of drug d...
research
06/09/2020

GEOM: Energy-annotated molecular conformations for property prediction and molecular generation

Machine learning outperforms traditional approaches in many molecular de...
research
06/01/2022

Graph Machine Learning for Design of High-Octane Fuels

Fuels with high-knock resistance enable modern spark-ignition engines to...
research
06/03/2023

Mitigating Molecular Aggregation in Drug Discovery with Predictive Insights from Explainable AI

As the importance of high-throughput screening (HTS) continues to grow d...
research
05/25/2021

Improving Machine Learning-Based Modeling of Semiconductor Devices by Data Self-Augmentation

In the electronics industry, introducing Machine Learning (ML)-based tec...

Please sign up or login with your details

Forgot password? Click here to reset