k-Center Clustering with Outliers in Sliding Windows

01/07/2022
by   Paolo Pellizzoni, et al.
0

Metric k-center clustering is a fundamental unsupervised learning primitive. Although widely used, this primitive is heavily affected by noise in the data, so that a more sensible variant seeks for the best solution that disregards a given number z of points of the dataset, called outliers. We provide efficient algorithms for this important variant in the streaming model under the sliding window setting, where, at each time step, the dataset to be clustered is the window W of the most recent data items. Our algorithms achieve O(1) approximation and, remarkably, require a working memory linear in k+z and only logarithmic in |W|. As a by-product, we show how to estimate the effective diameter of the window W, which is a measure of the spread of the window points, disregarding a given fraction of noisy distances. We also provide experimental evidence of the practical viability of our theoretical results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/24/2021

k-Center Clustering with Outliers in the Sliding-Window Model

The k-center problem for a point set P asks for a collection of k congru...
research
02/26/2018

Improved MapReduce and Streaming Algorithms for k-Center Clustering (with Outliers)

We present efficient MapReduce and Streaming algorithms for the k-center...
research
06/10/2020

Sliding Window Algorithms for k-Clustering Problems

The sliding window model of computation captures scenarios in which data...
research
02/16/2022

Distributed k-Means with Outliers in General Metrics

Center-based clustering is a pivotal primitive for unsupervised learning...
research
02/24/2023

k-Center Clustering with Outliers in the MPC and Streaming Model

Given a point set P ⊆ X of size n in a metric space (X,dist) of doubling...
research
05/09/2019

Coresets for Minimum Enclosing Balls over Sliding Windows

Coresets are important tools to generate concise summaries of massive da...
research
03/08/2021

Sliding Window Persistence of Quasiperiodic Functions

A function is called quasiperiodic if its fundamental frequencies are li...

Please sign up or login with your details

Forgot password? Click here to reset