An Analysis of Gene Expression Data using Penalized Fuzzy C-Means Approach

01/08/2013
by   P. K. Nizar Banu, et al.
0

With the rapid advances of microarray technologies, large amounts of high-dimensional gene expression data are being generated, which poses significant computational challenges. A first step towards addressing this challenge is the use of clustering techniques, which is essential in the data mining process to reveal natural structures and identify interesting patterns in the underlying data. A robust gene expression clustering approach to minimize undesirable clustering is proposed. In this paper, Penalized Fuzzy C-Means (PFCM) Clustering algorithm is described and compared with the most representative off-line clustering techniques: K-Means Clustering, Rough K-Means Clustering and Fuzzy C-Means clustering. These techniques are implemented and tested for a Brain Tumor gene expression Dataset. Analysis of the performance of the proposed approach is presented through qualitative validation experiments. From experimental results, it can be observed that Penalized Fuzzy C-Means algorithm shows a much higher usability than the other projected clustering algorithms used in our comparison study. Significant and promising clustering results are presented using Brain Tumor Gene expression dataset. Thus patterns seen in genome-wide expression experiments can be interpreted as indications of the status of cellular processes. In these clustering results, we find that Penalized Fuzzy C-Means algorithm provides useful information as an aid to diagnosis in oncology.

READ FULL TEXT
research
01/01/2021

Interval Type-2 Enhanced Possibilistic Fuzzy C-Means Clustering for Gene Expression Data Analysis

Both FCM and PCM clustering methods have been widely applied to pattern ...
research
02/03/2023

A Novel Fuzzy Bi-Clustering Algorithm with AFS for Identification of Co-Regulated Genes

The identification of co-regulated genes and their transcription-factor ...
research
03/26/2020

A New Gene Selection Algorithm using Fuzzy-Rough Set Theory for Tumor Classification

In statistics and machine learning, feature selection is the process of ...
research
08/18/2020

EXCLUVIS: A MATLAB GUI Software for Comparative Study of Clustering and Visualization of Gene Expression Data

Clustering is a popular data mining technique that aims to partition an ...
research
05/19/2015

Modelling-based experiment retrieval: A case study with gene expression clustering

Motivation: Public and private repositories of experimental data are gro...
research
07/01/2021

Feasibility of Haralick's Texture Features for the Classification of Chromogenic In-situ Hybridization Images

This paper presents a proof of concept for the usefulness of second-orde...
research
08/25/2022

PRIME: Uncovering Circadian Oscillation Patterns and Associations with AD in Untimed Genome-wide Gene Expression across Multiple Brain Regions

The disruption of circadian rhythm is a cardinal symptom for Alzheimer's...

Please sign up or login with your details

Forgot password? Click here to reset