Applicability and Interpretability of Hierarchical Agglomerative Clustering With or Without Contiguity Constraints

Hierarchical Agglomerative Classification (HAC) with Ward's linkage has been widely used since its introduction in Ward (1963). The present article reviews the different extensions of the method to various input data and the constrained framework, while providing applicability conditions. In addition, various versions of the graphical representation of the results as a dendrogram are also presented and their properties are clarified. While some of these results can sometimes be found in an heteroclite literature, we clarify and complete them all using a uniform background. In particular, this study reveals an important distinction between a consistency property of the dendrogram and the absence of crossover within it. Finally, a simulation study shows that the constrained version of HAC can sometimes provide more relevant results than its unconstrained version despite the fact that the latter optimizes the objective criterion on a reduced set of solutions at each step. Overall, the article provides comprehensive recommandations for the use of HAC and constrained HAC depending on the input data as well as for the representation of the results.

READ FULL TEXT

page 14

page 21

page 23

page 38

research
11/27/2011

Ward's Hierarchical Clustering Method: Clustering Criterion and Agglomerative Algorithm

The Ward error sum of squares hierarchical clustering method has been ve...
research
04/17/2014

Hierarchical Quasi-Clustering Methods for Asymmetric Networks

This paper introduces hierarchical quasi-clustering methods, a generaliz...
research
07/19/2017

On recent advances in 2D Constrained Delaunay triangulation algorithms

In this article, recent works on 2D Constrained Delaunay triangulation(C...
research
08/07/2019

A Survey of Constrained Combinatorial Testing

Combinatorial Testing (CT) is a potentially powerful testing technique, ...
research
08/17/2022

On Establishing Robust Consistency in Answer Set Programs

Answer set programs used in real-world applications often require that t...
research
03/21/2022

Coresets for Weight-Constrained Anisotropic Assignment and Clustering

The present paper constructs coresets for weight-constrained anisotropic...
research
05/27/2018

BIC extensions for order-constrained model selection

The Schwarz or Bayesian information criterion (BIC) is one of the most w...

Please sign up or login with your details

Forgot password? Click here to reset