Balancing Predictive Relevance of Ligand Biochemical Activities

04/06/2021
by   Marek Pecha, et al.
0

In this paper, we present a technique for balancing predictive relevance models related to supervised modelling ligand biochemical activities to biological targets. We train uncalibrated models employing conventional supervised machine learning technique, namely Support Vector Machines. Unfortunately, SVMs have a serious drawback. They are sensitive to imbalanced datasets, outliers and high multicollinearity among training samples, which could be a cause of preferencing one group over another. Thus, an additional calibration could be required for balancing a predictive relevance of models. As a technique for this balancing, we propose the Platt's scaling. The achieved results were demonstrated on single-target models trained on datasets exported from the ExCAPE database. Unlike traditional used machine techniques, we focus on decreasing uncertainty employing deterministic solvers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/16/2016

Algebraic multigrid support vector machines

The support vector machine is a flexible optimization-based technique wi...
research
06/30/2023

The Effect of Balancing Methods on Model Behavior in Imbalanced Classification Problems

Imbalanced data poses a significant challenge in classification as model...
research
08/07/2020

A Technique for Determining Relevance Scores of Process Activities using Graph-based Neural Networks

Process models generated through process mining depict the as-is state o...
research
01/12/2022

Careful! Training Relevance is Real

There is a recent proliferation of research on the integration of machin...
research
02/15/2018

Simulation assisted machine learning

Predicting how a proposed cancer treatment will affect a given tumor can...
research
10/17/2019

Ranking variables and interactions using predictive uncertainty measures

For complex nonlinear supervised learning models, assessing the relevanc...
research
01/21/2022

To SMOTE, or not to SMOTE?

In imbalanced binary classification problems the objective metric is oft...

Please sign up or login with your details

Forgot password? Click here to reset