Hierarchical Clustering for Finding Symmetries and Other Patterns in Massive, High Dimensional Datasets

05/14/2010
by   Fionn Murtagh, et al.
0

Data analysis and data mining are concerned with unsupervised pattern finding and structure determination in data sets. "Structure" can be understood as symmetry and a range of symmetries are expressed by hierarchy. Such symmetries directly point to invariants, that pinpoint intrinsic properties of the data and of the background empirical domain of interest. We review many aspects of hierarchy here, including ultrametric topology, generalized ultrametric, linkages with lattices and other discrete algebraic structures and with p-adic number representations. By focusing on symmetries in data we have a powerful means of structuring and analyzing massive, high dimensional data stores. We illustrate the powerfulness of hierarchical clustering in case studies in chemistry and finance, and we provide pointers to other published case studies.

READ FULL TEXT
research
05/18/2008

Symmetry in Data Mining and Analysis: A Unifying View based on Hierarchy

Data analysis and data mining are concerned with unsupervised pattern fi...
research
11/27/2011

Ward's Hierarchical Clustering Method: Clustering Criterion and Agglomerative Algorithm

The Ward error sum of squares hierarchical clustering method has been ve...
research
09/23/2020

Burning sage: Reversing the curse of dimensionality in the visualization of high-dimensional data

In high-dimensional data analysis the curse of dimensionality reasons th...
research
10/19/2019

Context-Driven Data Mining through Bias Removal and Data Incompleteness Mitigation

The results of data mining endeavors are majorly driven by data quality....
research
02/28/2018

Automatic topography of high-dimensional data sets by non-parametric Density Peak clustering

Data analysis in high-dimensional spaces aims at obtaining a synthetic d...
research
09/02/2021

Knot invariants and their relations: a topological perspective

This work brings methods from topological data analysis to knot theory a...

Please sign up or login with your details

Forgot password? Click here to reset