Approximating (k,ℓ)-Median Clustering for Polygonal Curves

09/03/2020
by   Maike Buchin, et al.
0

In 2015, Driemel, Krivošija and Sohler introduced the (k,ℓ)-median problem for clustering polygonal curves under the Fréchet distance. Given a set of input curves, the problem asks to find k median curves of at most ℓ vertices each that minimize the sum of Fréchet distances over all input curves to their closest median curve. A major shortcoming of their algorithm is that the input curves are restricted to lie on the real line. In this paper, we present a randomized bicriteria-approximation algorithm that works for polygonal curves in ℝ^d and achieves approximation factor (1+ϵ) with respect to the clustering costs. The algorithm has worst-case running-time linear in the number of curves, polynomial in the maximum number of vertices per curve, i.e. their complexity, and exponential in d, ℓ, ϵ and δ, i.e., the failure probability. We achieve this result through a shortcutting lemma, which guarantees the existence of a polygonal curve with similar cost as an optimal median curve of complexity ℓ, but of complexity at most 2ℓ-2, and whose vertices can be computed efficiently. We combine this lemma with the superset-sampling technique by Kumar et al. to derive our clustering result. In doing so, we describe and analyze a generalization of the algorithm by Ackermann et al., which may be of independent interest.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/19/2021

Coresets for (k, ℓ)-Median Clustering under the Fréchet Distance

We present an algorithm for computing ϵ-coresets for (k, ℓ)-median clust...
research
07/16/2022

Curve Simplification and Clustering under Fréchet Distance

We present new approximation results on curve simplification and cluster...
research
01/28/2020

On the complexity of the middle curve problem

For a set of curves, Ahn et al. introduced the notion of a middle curve ...
research
07/16/2019

Random projections and sampling algorithms for clustering of high-dimensional polygonal curves

We study the center and median clustering problems for high-dimensional ...
research
01/07/2019

Coresets for (k,l)-Clustering under the Fréchet Distance

Clustering is the task of partitioning a given set of geometric objects....
research
04/25/2021

Coresets for k-median clustering under Fréchet and Hausdorff distances

We give algorithms for computing coresets for (1+ε)-approximate k-median...
research
12/07/2012

Similarity of Polygonal Curves in the Presence of Outliers

The Fréchet distance is a well studied and commonly used measure to capt...

Please sign up or login with your details

Forgot password? Click here to reset