Cluster Forests

04/14/2011
by   Donghui Yan, et al.
0

With inspiration from Random Forests (RF) in the context of classification, a new clustering ensemble method---Cluster Forests (CF) is proposed. Geometrically, CF randomly probes a high-dimensional data cloud to obtain "good local clusterings" and then aggregates via spectral clustering to obtain cluster assignments for the whole dataset. The search for good local clusterings is guided by a cluster quality measure kappa. CF progressively improves each local clustering in a fashion that resembles the tree growth in RF. Empirical studies on several real-world datasets under two different performance metrics show that CF compares favorably to its competitors. Theoretical analysis reveals that the kappa measure makes it possible to grow the local clustering in a desirable way---it is "noise-resistant". A closed-form expression is obtained for the mis-clustering rate of spectral clustering under a perturbation model, which yields new insights into some aspects of spectral clustering.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/28/2019

Similarity Kernel and Clustering via Random Projection Forests

Similarity plays a fundamental role in many areas, including data mining...
research
06/12/2017

Fast Approximate Spectral Clustering for Dynamic Networks

Spectral clustering is a widely studied problem, yet its complexity is p...
research
06/29/2018

Certifying Global Optimality of Graph Cuts via Semidefinite Relaxation: A Performance Guarantee for Spectral Clustering

Spectral clustering has become one of the most widely used clustering te...
research
10/23/2022

Local and Global Structure Preservation Based Spectral Clustering

Spectral Clustering (SC) is widely used for clustering data on a nonline...
research
07/18/2022

Simplifying Clustering with Graph Neural Networks

The objective functions used in spectral clustering are usually composed...
research
04/30/2022

Understanding the Generalization Performance of Spectral Clustering Algorithms

The theoretical analysis of spectral clustering mainly focuses on consis...
research
02/06/2019

An Automated Spectral Clustering for Multi-scale Data

Spectral clustering algorithms typically require a priori selection of i...

Please sign up or login with your details

Forgot password? Click here to reset