Multiclass Disease Predictions Based on Integrated Clinical and Genomics Datasets

06/14/2020
by   Moeez M. Subhani, et al.
0

Clinical predictions using clinical data by computational methods are common in bioinformatics. However, clinical predictions using information from genomics datasets as well is not a frequently observed phenomenon in research. Precision medicine research requires information from all available datasets to provide intelligent clinical solutions. In this paper, we have attempted to create a prediction model which uses information from both clinical and genomics datasets. We have demonstrated multiclass disease predictions based on combined clinical and genomics datasets using machine learning methods. We have created an integrated dataset, using a clinical (ClinVar) and a genomics (gene expression) dataset, and trained it using instance-based learner to predict clinical diseases. We have used an innovative but simple way for multiclass classification, where the number of output classes is as high as 75. We have used Principal Component Analysis for feature selection. The classifier predicted diseases with 73% accuracy on the integrated dataset. The results were consistent and competent when compared with other classification models. The results show that genomics information can be reliably included in datasets for clinical predictions and it can prove to be valuable in clinical diagnostics and precision medicine.

READ FULL TEXT
research
08/04/2020

Detecting ulcerative colitis from colon samples using efficient feature selection and machine learning

Ulcerative colitis (UC) is one of the most common forms of inflammatory ...
research
06/09/2020

Vocal markers from sustained phonation in Huntington's Disease

Disease-modifying treatments are currently assessed in neurodegenerative...
research
06/03/2019

Predicting Onset of Dementia in Parkinson's Disease Patients

Alzheimer's disease (AD) and Parkinson's disease (PD) are the two most c...
research
03/24/2021

Drug Recommendation System based on Sentiment Analysis of Drug Reviews using Machine Learning

Since coronavirus has shown up, inaccessibility of legitimate clinical r...
research
10/26/2018

MCA-based Rule Mining Enables Interpretable Inference in Clinical Psychiatry

Development of interpretable machine learning models for clinical health...
research
07/03/2018

Building a Controlled Vocabulary for Standardizing Precision Medicine Terms

Rapid advances of technology and development of research in precision me...
research
07/21/2020

Outcome-Guided Disease Subtyping for High-Dimensional Omics Data

High-throughput microarray and sequencing technology have been used to i...

Please sign up or login with your details

Forgot password? Click here to reset