Quantizing Multiple Sources to a Common Cluster Center: An Asymptotic Analysis

10/23/2020
by   Erdem Koyuncu, et al.
0

We consider quantizing an Ld-dimensional sample, which is obtained by concatenating L vectors from datasets of d-dimensional vectors, to a d-dimensional cluster center. The distortion measure is the weighted sum of rth powers of the distances between the cluster center and the samples. For L=1, one recovers the ordinary center based clustering formulation. The general case L>1 appears when one wishes to cluster a dataset through L noisy observations of each of its members. We find a formula for the average distortion performance in the asymptotic regime where the number of cluster centers are large. We also provide an algorithm to numerically optimize the cluster centers and verify our analytical results on real and artificial datasets. In terms of faithfulness to the original (noiseless) dataset, our clustering approach outperforms the naive approach that relies on quantizing the Ld-dimensional noisy observation vectors to Ld-dimensional centers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2016

Clustering by connection center evolution

The determination of cluster centers generally depends on the scale that...
research
07/04/2020

Cluster Prediction for Opinion Dynamics from Partial Observations

We present a Bayesian approach to predict the clustering of opinions for...
research
12/30/2022

A Global Optimization Algorithm for K-Center Clustering of One Billion Samples

This paper presents a practical global optimization algorithm for the K-...
research
10/12/2022

Matern Cluster Process with Holes at the Cluster Centers

Inspired by recent applications of point processes to biological nanonet...
research
05/03/2023

CLUSTSEG: Clustering for Universal Segmentation

We present CLUSTSEG, a general, transformer-based framework that tackles...
research
06/22/2021

Diversity-aware k-median : Clustering with fair center representation

We introduce a novel problem for diversity-aware clustering. We assume t...
research
03/05/2019

A Deep Learning based approach to VM behavior identification in cloud systems

Cloud computing data centers are growing in size and complexity to the p...

Please sign up or login with your details

Forgot password? Click here to reset