ExClus: Explainable Clustering on Low-dimensional Data Representations

11/04/2021
by   Xander Vankwikelberge, et al.
0

Dimensionality reduction and clustering techniques are frequently used to analyze complex data sets, but their results are often not easy to interpret. We consider how to support users in interpreting apparent cluster structure on scatter plots where the axes are not directly interpretable, such as when the data is projected onto a two-dimensional space using a dimensionality-reduction method. Specifically, we propose a new method to compute an interpretable clustering automatically, where the explanation is in the original high-dimensional space and the clustering is coherent in the low-dimensional projection. It provides a tunable balance between the complexity and the amount of information provided, through the use of information theory. We study the computational complexity of this problem and introduce restrictions on the search space of solutions to arrive at an efficient, tunable, greedy optimization algorithm. This algorithm is furthermore implemented in an interactive tool called ExClus. Experiments on several data sets highlight that ExClus can provide informative and easy-to-understand patterns, and they expose where the algorithm is efficient and where there is room for improvement considering tunability and scalability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/25/2022

Laplacian-based Cluster-Contractive t-SNE for High Dimensional Data Visualization

Dimensionality reduction techniques aim at representing high-dimensional...
research
03/23/2023

Clustering based on Mixtures of Sparse Gaussian Processes

Creating low dimensional representations of a high dimensional data set ...
research
09/06/2019

Solving Interpretable Kernel Dimension Reduction

Kernel dimensionality reduction (KDR) algorithms find a low dimensional ...
research
03/26/2023

Interpretable Linear Dimensionality Reduction based on Bias-Variance Analysis

One of the central issues of several machine learning applications on re...
research
01/20/2020

Exploring Visual Patterns in Projected Human and Machine Decision-Making Paths

In problem solving, the paths towards solutions can be viewed as a seque...
research
03/14/2022

Accelerating Plug-and-Play Image Reconstruction via Multi-Stage Sketched Gradients

In this work we propose a new paradigm for designing fast plug-and-play ...
research
01/29/2019

Throttling Malware Families in 2D

Malicious software are categorized into families based on their static a...

Please sign up or login with your details

Forgot password? Click here to reset