Using PCA and Factor Analysis for Dimensionality Reduction of Bio-informatics Data

07/22/2017
by   M. Usman Ali, et al.
0

Large volume of Genomics data is produced on daily basis due to the advancement in sequencing technology. This data is of no value if it is not properly analysed. Different kinds of analytics are required to extract useful information from this raw data. Classification, Prediction, Clustering and Pattern Extraction are useful techniques of data mining. These techniques require appropriate selection of attributes of data for getting accurate results. However, Bioinformatics data is high dimensional, usually having hundreds of attributes. Such large a number of attributes affect the performance of machine learning algorithms used for classification/prediction. So, dimensionality reduction techniques are required to reduce the number of attributes that can be further used for analysis. In this paper, Principal Component Analysis and Factor Analysis are used for dimensionality reduction of Bioinformatics data. These techniques were applied on Leukaemia data set and the number of attributes was reduced from to.

READ FULL TEXT

page 5

page 8

research
01/09/2020

Supervised Discriminative Sparse PCA with Adaptive Neighbors for Dimensionality Reduction

Dimensionality reduction is an important operation in information visual...
research
08/08/2018

Feature Dimensionality Reduction for Video Affect Classification: A Comparative Study

Affective computing has become a very important research area in human-m...
research
12/30/2021

Dimensionality reduction for prediction: Application to Bitcoin and Ethereum

The objective of this paper is to assess the performances of dimensional...
research
07/10/2020

Numerical simulation, clustering and prediction of multi-component polymer precipitation

Multi-component polymer systems are of interest in organic photovoltaic ...
research
05/21/2016

Learning From Hidden Traits: Joint Factor Analysis and Latent Clustering

Dimensionality reduction techniques play an essential role in data analy...
research
08/01/2017

DROP: Dimensionality Reduction Optimization for Time Series

Dimensionality reduction is critical in analyzing increasingly high-volu...
research
05/06/2022

Application of Clustering Algorithms for Dimensionality Reduction in Infrastructure Resilience Prediction Models

Recent studies increasingly adopt simulation-based machine learning (ML)...

Please sign up or login with your details

Forgot password? Click here to reset