Clustering Optimisation Method for Highly Connected Biological Data

08/08/2022
by   Richard Tjörnhammar, et al.
0

Currently, data-driven discovery in biological sciences resides in finding segmentation strategies in multivariate data that produce sensible descriptions of the data. Clustering is but one of several approaches and sometimes falls short because of difficulties in assessing reasonable cutoffs, the number of clusters that need to be formed or that an approach fails to preserve topological properties of the original system in its clustered form. In this work, we show how a simple metric for connectivity clustering evaluation leads to an optimised segmentation of biological data. The novelty of the work resides in the creation of a simple optimisation method for clustering crowded data. The resulting clustering approach only relies on metrics derived from the inherent properties of the clustering. The new method facilitates knowledge for optimised clustering, which is easy to implement. We discuss how the clustering optimisation strategy corresponds to the viable information content yielded by the final segmentation. We further elaborate on how the clustering results, in the optimal solution, corresponds to prior knowledge of three different data sets.

READ FULL TEXT
research
02/23/2016

A Simple Approach to Sparse Clustering

Consider the problem of sparse clustering, where it is assumed that only...
research
11/30/2021

Easy Semantification of Bioassays

Biological data and knowledge bases increasingly rely on Semantic Web te...
research
11/20/2019

Discovering New Intents via Constrained Deep Adaptive Clustering with Cluster Refinement

Identifying new user intents is an essential task in the dialogue system...
research
09/21/2020

Interactive Steering of Hierarchical Clustering

Hierarchical clustering is an important technique to organize big data f...
research
10/11/2020

Local Connectivity in Centroid Clustering

Clustering is a fundamental task in unsupervised learning, one that targ...
research
12/11/2020

Superpixel Segmentation Based on Spatially Constrained Subspace Clustering

Superpixel segmentation aims at dividing the input image into some repre...
research
03/11/2023

Distributed Solution of the Inverse Rig Problem in Blendshape Facial Animation

The problem of rig inversion is central in facial animation as it allows...

Please sign up or login with your details

Forgot password? Click here to reset