Scaled torus principal component analysis

10/10/2021
by   Pavlos Zoubouloglou, et al.
0

A particularly challenging context for dimensionality reduction is multivariate circular data, i.e., data supported on a torus. Such kind of data appears, e.g., in the analysis of various phenomena in ecology and astronomy, as well as in molecular structures. This paper introduces Scaled Torus Principal Component Analysis (ST-PCA), a novel approach to perform dimensionality reduction with toroidal data. ST-PCA finds a data-driven map from a torus to a sphere of the same dimension and a certain radius. The map is constructed with multidimensional scaling to minimize the discrepancy between pairwise geodesic distances in both spaces. ST-PCA then resorts to principal nested spheres to obtain a nested sequence of subspheres that best fits the data, which can afterwards be inverted back to the torus. Numerical experiments illustrate how ST-PCA can be used to achieve meaningful dimensionality reduction on low-dimensional torii, particularly with the purpose of clusters separation, while two data applications in astronomy (three-dimensional torus) and molecular biology (on a seven-dimensional torus) show that ST-PCA outperforms existing methods for the investigated datasets.

READ FULL TEXT
research
01/07/2020

A kernel Principal Component Analysis (kPCA) digest with a new backward mapping (pre-image reconstruction) strategy

Methodologies for multidimensionality reduction aim at discovering low-d...
research
04/22/2022

Compressibility: Power of PCA in Clustering Problems Beyond Dimensionality Reduction

In this paper we take a step towards understanding the impact of princip...
research
07/01/2022

Local manifold learning and its link to domain-based physics knowledge

In many reacting flow systems, the thermo-chemical state-space is known ...
research
03/22/2019

Principal nested shape space analysis of molecular dynamics data

Molecular dynamics simulations produce huge datasets of temporal sequenc...
research
10/25/2017

DPCA: Dimensionality Reduction for Discriminative Analytics of Multiple Large-Scale Datasets

Principal component analysis (PCA) has well-documented merits for data e...
research
09/19/2023

O(k)-Equivariant Dimensionality Reduction on Stiefel Manifolds

Many real-world datasets live on high-dimensional Stiefel and Grassmanni...
research
05/23/2022

PCA-Boosted Autoencoders for Nonlinear Dimensionality Reduction in Low Data Regimes

Autoencoders (AE) provide a useful method for nonlinear dimensionality r...

Please sign up or login with your details

Forgot password? Click here to reset