Peer groups for organisational learning: clustering with practical constraints

11/17/2020
by   Daniel William Kennedy, et al.
0

Peer-grouping is used in many sectors for organisational learning, policy implementation, and benchmarking. Clustering provides a statistical, data-driven method for constructing meaningful peer groups, but peer groups must be compatible with business constraints such as size and stability considerations. Additionally, statistical peer groups are constructed from many different variables, and can be difficult to understand, especially for non-statistical audiences. We developed methodology to apply business constraints to clustering solutions and allow the decision-maker to choose the balance between statistical goodness-of-fit and conformity to business constraints. Several tools were utilised to identify complex distinguishing features in peer groups, and a number of visualisations are developed to explain high-dimensional clusters for non-statistical audiences. In a case study where peer group size was required to be small (≤ 100 members), we applied constrained clustering to a noisy high-dimensional data-set over two subsequent years, ensuring that the clusters were sufficiently stable between years. Our approach not only satisfied clustering constraints on the test data, but maintained an almost monotonic negative relationship between goodness-of-fit and stability between subsequent years. We demonstrated in the context of the case study how distinguishing features between clusters can be communicated clearly to different stakeholders with substantial and limited statistical knowledge.

READ FULL TEXT
research
06/16/2021

Clustering inference in multiple groups

Inference in clustering is paramount to uncovering inherent group struct...
research
10/24/2022

Post-clustering difference testing: valid inference and practical considerations

Clustering is part of unsupervised analysis methods that consist in grou...
research
11/17/2020

A statistical machine learning approach for benchmarking in the presence of complex contextual factors and peer groups

The ability to compare between individuals or organisations fairly is im...
research
11/12/2017

K-groups: A Generalization of K-means Clustering

We propose a new class of distribution-based clustering algorithms, call...
research
09/20/2022

Peer-group Behaviour Analytics of Windows Authentications Events Using Hierarchical Bayesian Modelling

Cyber-security analysts face an increasingly large number of alerts rece...
research
05/27/2023

Dynamic User Segmentation and Usage Profiling

Usage data of a group of users distributed across a number of categories...
research
06/16/2022

Deconstructing written rules and hierarchy in peer produced software communities

We employ recent advances in computational institutional analysis and NL...

Please sign up or login with your details

Forgot password? Click here to reset