Biclustering with Alternating K-Means

09/09/2020
by   Nicolas Fraiman, et al.
12

Biclustering is the task of simultaneously clustering the rows and columns of the data matrix into different subgroups such that the rows and columns within a subgroup exhibit similar patterns. In this paper, we consider the case of producing exclusive row and column biclusters. We provide a new formulation of the biclustering problem based on the idea of minimizing the empirical clustering risk. We develop and prove a consistency result with respect to the empirical clustering risk. Since the optimization problem is combinatorial in nature, finding the global minimum is computationally intractable. In light of this fact, we propose a simple and novel algorithm that finds a local minimum by alternating the use of an adapted version of the k-means clustering algorithm between columns and rows. We evaluate and compare the performance of our algorithm to other related biclustering methods on both simulated data and real-world gene expression data sets. The results demonstrate that our algorithm is able to detect meaningful structures in the data and outperform other competing biclustering methods in various settings and situations.

READ FULL TEXT
research
09/21/2021

Classification with Nearest Disjoint Centroids

In this paper, we develop a new classification method based on nearest c...
research
09/13/2018

MSc Dissertation: Exclusive Row Biclustering for Gene Expression Using a Combinatorial Auction Approach

The availability of large microarray data has led to a growing interest ...
research
03/29/2021

Automatic Clustering in Hyrise

Physical data layout is an important performance factor for modern datab...
research
06/29/2012

Consistent Biclustering

Biclustering, the process of simultaneously clustering the rows and colu...
research
10/16/2018

Co-manifold learning with missing data

Representation learning is typically applied to only one mode of a data ...
research
02/08/2020

Conjoined Dirichlet Process

Biclustering is a class of techniques that simultaneously clusters the r...
research
06/27/2012

Matrix Tile Analysis

Many tasks require finding groups of elements in a matrix of numbers, sy...

Please sign up or login with your details

Forgot password? Click here to reset