ZADU: A Python Library for Evaluating the Reliability of Dimensionality Reduction Embeddings

08/01/2023
by   Hyeon Jeon, et al.
0

Dimensionality reduction (DR) techniques inherently distort the original structure of input high-dimensional data, producing imperfect low-dimensional embeddings. Diverse distortion measures have thus been proposed to evaluate the reliability of DR embeddings. However, implementing and executing distortion measures in practice has so far been time-consuming and tedious. To address this issue, we present ZADU, a Python library that provides distortion measures. ZADU is not only easy to install and execute but also enables comprehensive evaluation of DR embeddings through three key features. First, the library covers a wide range of distortion measures. Second, it automatically optimizes the execution of distortion measures, substantially reducing the running time required to execute multiple measures. Last, the library informs how individual points contribute to the overall distortions, facilitating the detailed analysis of DR embeddings. By simulating a real-world scenario of optimizing DR embeddings, we verify that our optimization scheme substantially reduces the time required to execute distortion measures. Finally, as an application of ZADU, we present another library called ZADUVis that allows users to easily create distortion visualizations that depict the extent to which each region of an embedding suffers from distortions.

READ FULL TEXT
research
08/01/2023

Classes are not Clusters: Improving Label-based Evaluation of Dimensionality Reduction

A common way to evaluate the reliability of dimensionality reduction (DR...
research
06/18/2017

Dimensionality Reduction using Similarity-induced Embeddings

The vast majority of Dimensionality Reduction (DR) techniques rely on se...
research
08/02/2020

A Visual Analytics Framework for Reviewing Multivariate Time-Series Data with Dimensionality Reduction

Data-driven problem solving in many real-world applications involves ana...
research
03/03/2021

Minimum-Distortion Embedding

We consider the vector embedding problem. We are given a finite set of i...
research
04/15/2023

Dimensionality Reduction as Probabilistic Inference

Dimensionality reduction (DR) algorithms compress high-dimensional data ...
research
07/14/2021

Optimality of the Johnson-Lindenstrauss Dimensionality Reduction for Practical Measures

It is well known that the Johnson-Lindenstrauss dimensionality reduction...
research
05/01/2022

Uniform Manifold Approximation with Two-phase Optimization

We introduce Uniform Manifold Approximation with Two-phase Optimization ...

Please sign up or login with your details

Forgot password? Click here to reset