Molecular Design Based on Integer Programming and Splitting Data Sets by Hyperplanes

04/27/2023
by   Jianshen Zhu, et al.
0

A novel framework for designing the molecular structure of chemical compounds with a desired chemical property has recently been proposed. The framework infers a desired chemical graph by solving a mixed integer linear program (MILP) that simulates the computation process of a feature function defined by a two-layered model on chemical graphs and a prediction function constructed by a machine learning method. To improve the learning performance of prediction functions in the framework, we design a method that splits a given data set 𝒞 into two subsets 𝒞^(i),i=1,2 by a hyperplane in a chemical space so that most compounds in the first (resp., second) subset have observed values lower (resp., higher) than a threshold θ. We construct a prediction function ψ to the data set 𝒞 by combining prediction functions ψ_i,i=1,2 each of which is constructed on 𝒞^(i) independently. The results of our computational experiments suggest that the proposed method improved the learning performance for several chemical properties to which a good prediction function has been difficult to construct.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/06/2021

An Inverse QSAR Method Based on Linear Regression and Integer Programming

Recently a novel framework has been proposed for designing the molecular...
research
06/02/2023

Chemical Property-Guided Neural Networks for Naphtha Composition Prediction

The naphtha cracking process heavily relies on the composition of naphth...
research
09/27/2022

Machine learning-accelerated chemistry modeling of protoplanetary disks

Aims. With the large amount of molecular emission data from (sub)millime...
research
02/09/2023

ChemVise: Maximizing Out-of-Distribution Chemical Detection with the Novel Application of Zero-Shot Learning

Accurate chemical sensors are vital in medical, military, and home safet...
research
08/23/2021

Molecular Design Based on Artificial Neural Networks, Integer Programming and Grid Neighbor Search

A novel framework has recently been proposed for designing the molecular...
research
05/13/2020

MLSolv-A: A Novel Machine Learning-Based Prediction of Solvation Free Energies from Pairwise Atomistic Interactions

Recent advances in machine learning technologies and their chemical appl...
research
04/16/2017

Deep Learning Based Regression and Multi-class Models for Acute Oral Toxicity Prediction with Automatic Chemical Feature Extraction

For quantitative structure-property relationship (QSPR) studies in chemo...

Please sign up or login with your details

Forgot password? Click here to reset