Visualizing Overlapping Biclusterings and Boolean Matrix Factorizations

07/14/2023
by   Thibault Marette, et al.
0

Finding (bi-)clusters in bipartite graphs is a popular data analysis approach. Analysts typically want to visualize the clusters, which is simple as long as the clusters are disjoint. However, many modern algorithms find overlapping clusters, making visualization more complicated. In this paper, we study the problem of visualizing a given clustering of overlapping clusters in bipartite graphs and the related problem of visualizing Boolean Matrix Factorizations. We conceptualize three different objectives that any good visualization should satisfy: (1) proximity of cluster elements, (2) large consecutive areas of elements from the same cluster, and (3) large uninterrupted areas in the visualization, regardless of the cluster membership. We provide objective functions that capture these goals and algorithms that optimize these objective functions. Interestingly, in experiments on real-world datasets, we find that the best trade-off between these competing goals is achieved by a novel heuristic, which locally aims to place rows and columns with similar cluster membership next to each other.

READ FULL TEXT

page 2

page 4

page 9

page 10

page 11

research
04/24/2020

Non-Exhaustive, Overlapping Co-Clustering: An Extended Analysis

The goal of co-clustering is to simultaneously identify a clustering of ...
research
05/14/2018

Algorithms and Complexity of Range Clustering

We introduce a novel criterion in clustering that seeks clusters with li...
research
10/28/2019

Same-Cluster Querying for Overlapping Clusters

Overlapping clusters are common in models of many practical data-segment...
research
01/27/2020

A Proposed Method for Assessing Cluster Heterogeneity

Assessing how adequate clusters fit a dataset and finding an optimum num...
research
11/12/2021

An Enhanced Adaptive Bi-clustering Algorithm through Building a Shielding Complex Sub-Matrix

Bi-clustering refers to the task of finding sub-matrices (indexed by a g...
research
08/18/2021

Stochastic Cluster Embedding

Neighbor Embedding (NE) that aims to preserve pairwise similarities betw...
research
03/09/2016

Bipartite Correlation Clustering -- Maximizing Agreements

In Bipartite Correlation Clustering (BCC) we are given a complete bipart...

Please sign up or login with your details

Forgot password? Click here to reset