RAFP-Pred: Robust Prediction of Antifreeze Proteins using Localized Analysis of n-Peptide Compositions

09/25/2018
by   Shujaat Khan, et al.
0

In extreme cold weather, living organisms produce Antifreeze Proteins (AFPs) to counter the otherwise lethal intracellular formation of ice. Structures and sequences of various AFPs exhibit a high degree of heterogeneity, consequently the prediction of the AFPs is considered to be a challenging task. In this research, we propose to handle this arduous manifold learning task using the notion of localized processing. In particular an AFP sequence is segmented into two sub-segments each of which is analyzed for amino acid and di-peptide compositions. We propose to use only the most significant features using the concept of information gain (IG) followed by a random forest classification approach. The proposed RAFP-Pred achieved an excellent performance on a number of standard datasets. We report a high Youden's index (sensitivity+specificity-1) value of 0.75 on the standard independent test data set outperforming the AFP-PseAAC, AFP_PSSM, AFP-Pred and iAFP by a margin of 0.05, 0.06, 0.14 and 0.68 respectively. The verification rate on the UniProKB dataset is found to be 83.19% which is substantially superior to the 57.18% reported for the iAFP method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/13/2020

Automation of Hemocompatibility Analysis Using Image Segmentation and a Random Forest

The hemocompatibility of blood-contacting medical devices remains one of...
research
03/29/2022

Explaining random forest prediction through diverse rulesets

Tree-ensemble algorithms, such as random forest, are effective machine l...
research
02/25/2021

Random Forest based Qantile Oriented Sensitivity Analysis indices estimation

We propose a random forest based estimation procedure for Quantile Orien...
research
01/12/2022

Predicting Terrorist Attacks in the United States using Localized News Data

Terrorism is a major problem worldwide, causing thousands of fatalities ...
research
06/01/2018

Spatially Localized Atlas Network Tiles Enables 3D Whole Brain Segmentation from Limited Data

Whole brain segmentation on a structural magnetic resonance imaging (MRI...
research
04/02/2021

IITK@LCP at SemEval 2021 Task 1: Classification for Lexical Complexity Regression Task

This paper describes our contribution to SemEval 2021 Task 1: Lexical Co...

Please sign up or login with your details

Forgot password? Click here to reset