EXCLUVIS: A MATLAB GUI Software for Comparative Study of Clustering and Visualization of Gene Expression Data

08/18/2020
by   Sudip Poddar, et al.
0

Clustering is a popular data mining technique that aims to partition an input space into multiple homogeneous regions. There exist several clustering algorithms in the literature. The performance of a clustering algorithm depends on its input parameters which can substantially affect the behavior of the algorithm. Cluster validity indices determine the partitioning that best fits the underlying data. In bioinformatics, microarray gene expression technology has made it possible to measure the gene expression levels of thousands of genes simultaneously. Many genomic studies, which aim to analyze the functions of some genes, highly rely on some clustering technique for grouping similarly expressed genes in one cluster or partitioning tissue samples based on similar expression values of genes. In this work, an application package called EXCLUVIS (gene EXpression data CLUstering and VISualization) has been developed using MATLAB Graphical User Interface (GUI) environment for analyzing the performances of different clustering algorithms on gene expression datasets. In this application package, the user needs to select a number of parameters such as internal validity indices, external validity indices and number of clusters from the active windows for evaluating the performance of the clustering algorithms. EXCLUVIS compares the performances of K-means, fuzzy C-means, hierarchical clustering and multiobjective evolutionary clustering algorithms. Heatmap and cluster profile plots are used for visualizing the results. EXCLUVIS allows the users to easily find the goodness of clustering solutions as well as provides visual representations of the clustering outcomes.

READ FULL TEXT

page 9

page 10

page 11

page 12

page 13

page 14

page 16

page 17

research
05/19/2020

A New Validity Index for Fuzzy-Possibilistic C-Means Clustering

In some complicated datasets, due to the presence of noisy data points a...
research
07/26/2020

BIDEAL: A Toolbox for Bicluster Analysis – Generation, Visualization and Validation

This paper introduces a novel toolbox named BIDEAL for the generation of...
research
03/14/2021

Pandemonium: a clustering tool to partition parameter space – application to the B anomalies

We introduce the interactive tool pandemonium to cluster model predictio...
research
04/24/2023

Unsupervised Machine Learning to Classify the Confinement of Waves in Periodic Superstructures

We employ unsupervised machine learning to enhance the accuracy of our r...
research
01/08/2013

An Analysis of Gene Expression Data using Penalized Fuzzy C-Means Approach

With the rapid advances of microarray technologies, large amounts of hig...
research
11/03/2016

A-Ward_pe̱ṯa̱: Effective hierarchical clustering using the Minkowski metric and a fast k -means initialisation

In this paper we make two novel contributions to hierarchical clustering...
research
05/19/2015

Modelling-based experiment retrieval: A case study with gene expression clustering

Motivation: Public and private repositories of experimental data are gro...

Please sign up or login with your details

Forgot password? Click here to reset