Clubmark: a Parallel Isolation Framework for Benchmarking and Profiling Clustering Algorithms on NUMA Architectures

02/01/2019
by   Artem Lutov, et al.
0

There is a great diversity of clustering and community detection algorithms, which are key components of many data analysis and exploration systems. To the best of our knowledge, however, there does not exist yet any uniform benchmarking framework, which is publicly available and suitable for the parallel benchmarking of diverse clustering algorithms on a wide range of synthetic and real-world datasets. In this paper, we introduce Clubmark, a new extensible framework that aims to fill this gap by providing a parallel isolation benchmarking platform for clustering algorithms and their evaluation on NUMA servers. Clubmark allows for fine-grained control over various execution variables (timeouts, memory consumption, CPU affinity and cache policy) and supports the evaluation of a wide range of clustering algorithms including multi-level, hierarchical and overlapping clustering techniques on both weighted and unweighted input networks with built-in evaluation of several extrinsic and intrinsic measures. Our framework is open-source and provides a consistent and systematic way to execute, evaluate and profile clustering techniques considering a number of aspects that are often missing in state-of-the-art frameworks and benchmarking systems.

READ FULL TEXT

page 1

page 3

page 5

research
02/01/2019

Accuracy Evaluation of Overlapping and Multi-resolution Clustering Algorithms on Large Datasets

Performance of clustering algorithms is evaluated with the help of accur...
research
07/09/2022

A novel evaluation methodology for supervised Feature Ranking algorithms

Both in the domains of Feature Selection and Interpretable AI, there exi...
research
11/19/2019

The Design and Implementation of a Scalable DL Benchmarking Platform

The current Deep Learning (DL) landscape is fast-paced and is rife with ...
research
08/09/2019

Carl-Hauser -- Open Source Image Matching Algorithms Benchmarking Framework

Security analysts need to classify, search and correlate numerous images...
research
02/25/2021

BeFaaS: An Application-Centric Benchmarking Framework for FaaS Platforms

Following the increasing interest and adoption of FaaS systems, benchmar...
research
09/19/2019

DAOC: Stable Clustering of Large Networks

Clustering is a crucial component of many data mining systems involving ...
research
06/17/2016

Generating Object Cluster Hierarchies for Benchmarking

The field of Machine Learning and the topic of clustering within it is s...

Please sign up or login with your details

Forgot password? Click here to reset