GPU-Accelerated Optimizer-Aware Evaluation of Submodular Exemplar Clustering

01/21/2021
by   Philipp-Jan Honysz, et al.
11

The optimization of submodular functions constitutes a viable way to perform clustering. Strong approximation guarantees and feasible optimization w.r.t. streaming data make this clustering approach favorable. Technically, submodular functions map subsets of data to real values, which indicate how "representative" a specific subset is. Optimal sets might then be used to partition the data space and to infer clusters. Exemplar-based clustering is one of the possible submodular functions, but suffers from high computational complexity. However, for practical applications, the particular real-time or wall-clock run-time is decisive. In this work, we present a novel way to evaluate this particular function on GPUs, which keeps the necessities of optimizers in mind and reduces wall-clock run-time. To discuss our GPU algorithm, we investigated both the impact of different run-time critical problem properties, like data dimensionality and the number of data points in a subset, and the influence of required floating-point precision. In reproducible experiments, our GPU algorithm was able to achieve competitive speedups of up to 72x depending on whether multi-threaded computation on CPUs was used for comparison and the type of floating-point precision required. Half-precision GPU computation led to large speedups of up to 452x compared to single-precision, single-thread CPU computations.

READ FULL TEXT
research
05/25/2021

Providing Meaningful Data Summarizations Using Exemplar-based Clustering in Industry 4.0

Data summarizations are a valuable tool to derive knowledge from large d...
research
03/03/2022

Linear Run Time of Persistent Homology Computation with GPU Parallelization

Persistent homology is a crucial invariant that is used in many areas to...
research
09/02/2021

Clock Skew Compensation Algorithm Immune to Floating-Point Precision Loss

We propose a novel clock skew compensation algorithm based on Bresenham'...
research
06/16/2020

Multi-Precision Policy Enforced Training (MuPPET): A precision-switching strategy for quantised fixed-point training of CNNs

Large-scale convolutional neural networks (CNNs) suffer from very long t...
research
10/05/2018

Scaling Submodular Optimization Approaches for Control Applications in Networked Systems

Often times, in many design problems, there is a need to select a small ...
research
06/19/2021

CenterAtt: Fast 2-stage Center Attention Network

In this technical report, we introduce the methods of HIKVISION_LiDAR_De...

Please sign up or login with your details

Forgot password? Click here to reset