Random forest models of the retention constants in the thin layer chromatography

06/16/2011
by   Miron B. Kursa, et al.
0

In the current study we examine an application of the machine learning methods to model the retention constants in the thin layer chromatography (TLC). This problem can be described with hundreds or even thousands of descriptors relevant to various molecular properties, most of them redundant and not relevant for the retention constant prediction. Hence we employed feature selection to significantly reduce the number of attributes. Additionally we have tested application of the bagging procedure to the feature selection. The random forest regression models were built using selected variables. The resulting models have better correlation with the experimental data than the reference models obtained with linear regression. The cross-validation confirms robustness of the models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/25/2011

The All Relevant Feature Selection using Random Forest

In this paper we examine the application of the random forest classifier...
research
03/04/2023

Integration of Feature Selection Techniques using a Sleep Quality Dataset for Comparing Regression Algorithms

This research aims to examine the usefulness of integrating various feat...
research
04/28/2020

Analysis of ensemble feature selection for correlated high-dimensional RNA-Seq cancer data

Discovery of diagnostic and prognostic molecular markers is important an...
research
11/07/2020

Machine learning applications to DNA subsequence and restriction site analysis

Based on the BioBricks standard, restriction synthesis is a novel catabo...
research
10/13/2018

MaaSim: A Liveability Simulation for Improving the Quality of Life in Cities

Urbanism is no longer planned on paper thanks to powerful models and 3D ...
research
12/27/2022

Feature Selection Approaches for Optimising Music Emotion Recognition Methods

The high feature dimensionality is a challenge in music emotion recognit...
research
04/01/2020

Sequential Feature Classification in the Context of Redundancies

The problem of all-relevant feature selection is concerned with finding ...

Please sign up or login with your details

Forgot password? Click here to reset