QSAR Classification Modeling for Bioactivity of Molecular Structure via SPL-Logsum

04/23/2018
by   Liang-Yong Xia, et al.
0

Quantitative structure-activity relationship (QSAR) modelling is effective 'bridge' to search the reliable relationship related bioactivity to molecular structure. A QSAR classification model contains a lager number of redundant, noisy and irrelevant descriptors. To address this problem, various of methods have been proposed for descriptor selection. Generally, they can be grouped into three categories: filters, wrappers, and embedded methods. Regularization method is an important embedded technology, which can be used for continuous shrinkage and automatic descriptors selection. In recent years, the interest of researchers in the application of regularization techniques is increasing in descriptors selection , such as, logistic regression(LR) with L_1 penalty. In this paper, we proposed a novel descriptor selection method based on self-paced learning(SPL) with Logsum penalized LR for predicting the bioactivity of molecular structure. SPL inspired by the learning process of humans and animals that gradually learns from easy samples(smaller losses) to hard samples(bigger losses) samples into training and Logsum regularization has capacity to select few meaningful and significant molecular descriptors, respectively. Experimental results on simulation and three public QSAR datasets show that our proposed SPL-Logsum method outperforms other commonly used sparse methods in terms of classification performance and model interpretation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/23/2018

Descriptor Selection via Self-Paced Learning for Bioactivity of Molecular Structure in QSAR Classification

Quantitative structure-activity relationship (QSAR) modelling is effecti...
research
03/25/2021

Quantitative Prediction on the Enantioselectivity of Multiple Chiral Iodoarene Scaffolds Based on Whole Geometry

The mechanistic underpinnings of asymmetric catalysis at atomic levels p...
research
02/21/2014

Important Molecular Descriptors Selection Using Self Tuned Reweighted Sampling Method for Prediction of Antituberculosis Activity

In this paper, a new descriptor selection method for selecting an optima...
research
04/10/2019

Classification of signaling proteins based on molecular star graph descriptors using Machine Learning models

Signaling proteins are an important topic in drug development due to the...
research
10/20/2022

A Methodology for the Prediction of Drug Target Interaction using CDK Descriptors

Detecting probable Drug Target Interaction (DTI) is a critical task in d...
research
04/03/2023

Development and Evaluation of Conformal Prediction Methods for QSAR

The quantitative structure-activity relationship (QSAR) regression model...

Please sign up or login with your details

Forgot password? Click here to reset