An Enhanced Adaptive Bi-clustering Algorithm through Building a Shielding Complex Sub-Matrix

11/12/2021
by   Kaijie Xu, et al.
0

Bi-clustering refers to the task of finding sub-matrices (indexed by a group of columns and a group of rows) within a matrix of data such that the elements of each sub-matrix (data and features) are related in a particular way, for instance, that they are similar with respect to some metric. In this paper, after analyzing the well-known Cheng and Church (CC) bi-clustering algorithm which has been proved to be an effective tool for mining co-expressed genes. However, Cheng and Church bi-clustering algorithm and summarizing its limitations (such as interference of random numbers in the greedy strategy; ignoring overlapping bi-clusters), we propose a novel enhancement of the adaptive bi-clustering algorithm, where a shielding complex sub-matrix is constructed to shield the bi-clusters that have been obtained and to discover the overlapping bi-clusters. In the shielding complex sub-matrix, the imaginary and the real parts are used to shield and extend the new bi-clusters, respectively, and to form a series of optimal bi-clusters. To assure that the obtained bi-clusters have no effect on the bi-clusters already produced, a unit impulse signal is introduced to adaptively detect and shield the constructed bi-clusters. Meanwhile, to effectively shield the null data (zero-size data), another unit impulse signal is set for adaptive detecting and shielding. In addition, we add a shielding factor to adjust the mean squared residue score of the rows (or columns), which contains the shielded data of the sub-matrix, to decide whether to retain them or not. We offer a thorough analysis of the developed scheme. The experimental results are in agreement with the theoretical analysis. The results obtained on a publicly available real microarray dataset show the enhancement of the bi-clusters performance thanks to the proposed method.

READ FULL TEXT

page 1

page 4

research
02/03/2023

A Novel Fuzzy Bi-Clustering Algorithm with AFS for Identification of Co-Regulated Genes

The identification of co-regulated genes and their transcription-factor ...
research
05/12/2020

A Novel Granular-Based Bi-Clustering Method of Deep Mining the Co-Expressed Genes

Traditional clustering methods are limited when dealing with huge and he...
research
02/09/2020

Bi-objective Optimization of Biclustering with Binary Data

Clustering consists of partitioning data objects into subsets called clu...
research
07/14/2023

Visualizing Overlapping Biclusterings and Boolean Matrix Factorizations

Finding (bi-)clusters in bipartite graphs is a popular data analysis app...
research
04/13/2009

KiWi: A Scalable Subspace Clustering Algorithm for Gene Expression Analysis

Subspace clustering has gained increasing popularity in the analysis of ...
research
05/25/2018

COREclust: a new package for a robust and scalable analysis of complex data

In this paper, we present a new R package COREclust dedicated to the det...
research
09/02/2020

Combining Determinism and Indeterminism

Our goal is to construct mathematical operations that combine indetermin...

Please sign up or login with your details

Forgot password? Click here to reset