A SVM Model for Candidate Y-chromosome Gene Discovery in Prostate Cancer

03/06/2022
by   Dulani Meedeniya, et al.
0

Prostate cancer is widely known to be one of the most common cancers among men around the world. Due to its high heterogeneity, many of the studies carried out to identify the molecular level causes for cancer have only been partially successful. Among the techniques used in cancer studies, gene expression profiling is seen to be one of the most popular techniques due to its high usage. Gene expression profiles reveal information about the functionality of genes in different body tissues at different conditions. In order to identify cancer-decisive genes, differential gene expression analysis is carried out using statistical and machine learning methodologies. It helps to extract information about genes that have significant expression differences between healthy tissues and cancerous tissues. In this paper, we discuss a comprehensive supervised classification approach using Support Vector Machine (SVM) models to investigate differentially expressed Y-chromosome genes in prostate cancer. 8 SVM models, which are tuned to have 98.3% average accuracy have been used for the analysis. We were able to capture genes like CD99 (MIC2), ASMTL, DDX3Y and TXLNGY to come out as the best candidates. Some of our results support existing findings while introducing novel findings to be possible prostate cancer candidates.

READ FULL TEXT

page 7

page 8

research
07/12/2012

Biogeography-Based Informative Gene Selection and Cancer Classification Using SVM and Random Forests

Microarray cancer gene expression data comprise of very high dimensions....
research
05/02/2018

Prediction of a Gene Regulatory Network from Gene Expression Profiles With Linear Regression and Pearson Correlation Coefficient

Reconstruction of gene regulatory networks is the process of identifying...
research
11/10/2017

A Novel Bayesian Multiple Testing Approach to Deregulated miRNA Discovery Harnessing Positional Clustering

MicroRNAs (miRNAs) are endogenous, small non-coding RNAs that function a...
research
04/06/2020

Breast and Colon Cancer Classification from Gene Expression Profiles Using Data Mining Techniques

Early detection of cancer increases the probability of recovery. This pa...
research
04/24/2019

Using Machine Learning and Natural Language Processing to Review and Classify the Medical Literature on Cancer Susceptibility Genes

PURPOSE: The medical literature relevant to germline genetics is growing...
research
08/29/2022

Attention-based Interpretable Regression of Gene Expression in Histology

Interpretability of deep learning is widely used to evaluate the reliabi...
research
11/30/2021

SurvODE: Extrapolating Gene Expression Distribution for Early Cancer Identification

With the increasingly available large-scale cancer genomics datasets, ma...

Please sign up or login with your details

Forgot password? Click here to reset