SBSM-Pro: Support Bio-sequence Machine for Proteins

08/20/2023
by   Yizheng Wang, et al.
0

Proteins play a pivotal role in biological systems. The use of machine learning algorithms for protein classification can assist and even guide biological experiments, offering crucial insights for biotechnological applications. We propose a support bio-sequence machine for proteins, a model specifically designed for biological sequence classification. This model starts with raw sequences and groups amino acids based on their physicochemical properties. It incorporates sequence alignment to measure the similarities between proteins and uses a novel MKL approach to integrate various types of information, utilizing support vector machines for classification prediction. The results indicate that our model demonstrates commendable performance across 10 datasets in terms of the identification of protein function and posttranslational modification. This research not only showcases state-of-the-art work in protein classification but also paves the way for new directions in this domain, representing a beneficial endeavour in the development of platforms tailored for biological sequence classification. SBSM-Pro is available for access at http://lab.malab.cn/soft/SBSM-Pro/.

READ FULL TEXT

page 7

page 11

page 22

page 24

research
03/17/2015

ProtVec: A Continuous Distributed Representation of Biological Sequences

We introduce a new representation and feature extraction method for biol...
research
06/17/2018

MCP: a Multi-Component learning machine to Predict protein secondary structure

The Gene or DNA sequence in every cell does not control genetic properti...
research
04/30/2023

Importance Weighted Expectation-Maximization for Protein Sequence Design

Designing protein sequences with desired biological function is crucial ...
research
04/10/2015

Diffusion Component Analysis: Unraveling Functional Topology in Biological Networks

Complex biological systems have been successfully modeled by biochemical...
research
04/10/2019

Classification of signaling proteins based on molecular star graph descriptors using Machine Learning models

Signaling proteins are an important topic in drug development due to the...
research
04/26/2018

MPGM: Scalable and Accurate Multiple Network Alignment

Protein-protein interaction (PPI) network alignment is a canonical opera...
research
06/30/2011

On Prediction Using Variable Order Markov Models

This paper is concerned with algorithms for prediction of discrete seque...

Please sign up or login with your details

Forgot password? Click here to reset