Comparing Two Partitions of Non-Equal Sets of Units

05/21/2018
by   Marjan Cugmas, et al.
0

Rand (1971) proposed what has since become a well-known index for comparing two partitions obtained on the same set of units. The index takes a value on the interval between 0 and 1, where a higher value indicates more similar partitions. Sometimes, e.g. when the units are observed in two time periods, the splitting and merging of clusters should be considered differently, according to the operationalization of the stability of clusters. The Rand Index is symmetric in the sense that both the splitting and merging of clusters lower the value of the index. In such a non-symmetric case, one of the Wallace indexes (Wallace, 1983) can be used. Further, there are several cases when one wants to compare two partitions obtained on different sets of units, where the intersection of these sets of units is a non-empty set of units. In this instance, the new units and units which leave the clusters from the first partition can be considered as a factor lowering the value of the index. Therefore, a modified Rand index is presented. Because the splitting and merging of clusters have to be considered differently in some situations, an asymmetric modified Wallace Index is also proposed. For all presented indices, the correction for chance is described, which allows different values of a selected index to be compared.

READ FULL TEXT

page 15

page 18

research
07/30/2019

Comparing partitions through the Matching Error

With the aim to propose a non parametric hypothesis test, this paper car...
research
11/02/2017

Scientific co-authorship networks

The paper addresses the stability of the co-authorship networks in time....
research
01/07/2019

Understanding partition comparison indices based on counting object pairs

In unsupervised machine learning, agreement between partitions is common...
research
06/02/2019

Comprehensive cluster validity Index based on structural simplicity

Nonhierarchical clustering depending on unsupervised algorithms may not ...
research
06/25/2022

Inverted Semantic-Index for Image Retrieval

This paper addresses the construction of inverted index for large-scale ...
research
06/17/2016

Ground Truth Bias in External Cluster Validity Indices

It has been noticed that some external CVIs exhibit a preferential bias ...
research
03/10/2023

A low-order automatic domain splitting approach for nonlinear uncertainty mapping

This paper introduces a novel method for the automatic detection and han...

Please sign up or login with your details

Forgot password? Click here to reset