Pooled scale estimators for scaling prior to cluster analysis

12/22/2019
by   Jakob Raymaekers, et al.
0

We propose a new approach for scaling prior to cluster analysis based on the concept of pooled variance. Unlike available scaling procedures such as the standard deviation and the range, our proposed scale avoids dampening the beneficial effect of informative clustering variables. We confirm through an extensive simulation study and applications to well known real data examples that the proposed scaling method is safe and generally useful. Finally, we use our approach to cluster a high dimensional genomic dataset consisting of gene expression data for several specimens of breast cancer cells tissue.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/22/2019

Pooled variable scaling for cluster analysis

We propose a new approach for scaling prior to cluster analysis based on...
research
05/17/2022

Shape complexity in cluster analysis

In cluster analysis, a common first step is to scale the data aiming to ...
research
02/08/2022

Adaptive Bayesian Variable Clustering via Structural Learning of Breast Cancer Data

Clustering of proteins is of interest in cancer cell biology. This artic...
research
01/23/2022

Non-decimated 2D Wavelet Spectrum and Its Use in Breast Cancer Diagnostics

To improve diagnostic accuracy of breast cancer detection, several resea...
research
08/18/2017

Data-Driven Tree Transforms and Metrics

We consider the analysis of high dimensional data given in the form of a...
research
10/28/2019

Estimation and inference for the indirect effect in high-dimensional linear mediation models

Mediation analysis is difficult when the number of potential mediators i...
research
06/14/2021

On the Adimensional Scale Invariant Steffensen (ASIS) Method

Dimensionality of parameters and variables is a fundamental issue in phy...

Please sign up or login with your details

Forgot password? Click here to reset