Bi-criteria Approximation Algorithms for Minimum Enclosing Ball and k-Center Clustering with Outliers

04/25/2018
by   Hu Ding, et al.
0

Motivated by the arising realistic issues in big data, the problem of Minimum Enclosing Ball (MEB) with outliers has attracted a great deal of attention in recent years. Though several methods have been proposed in both theory and practice, most of them still suffer from the issues such as high time complexities or unstable performances for different datasets. For example, Kumar et al. DBLP:journals/jea/KumarMY03 proposed the open problem "Are there practical methods for computing an MEB with outliers?" To answer this question, we present a randomized algorithm for MEB with outliers in high dimension. In particular, we provide a more "robust" analysis for the core-set construction of MEB, and propose a couple of novel improvements to further reduce the time complexity. The ideas behind are interesting in their own right and we expect to apply them to solve more high-dimensional problems. We also extend our method to the problem of k-center clustering with outliers. To show the efficiency and practicality, we test our algorithms on random datasets and also apply it to solve outlier recognition on benchmark image datasets. To our best knowledge, this is the first result yielding both solid theoretical quality guarantee and promising practical performance for the problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/25/2018

Solving Minimum Enclosing Ball with Outliers: Algorithm, Implementation, and Application

Motivated by the arising realistic issues in big data, the problem of Mi...
research
04/08/2019

Minimum Enclosing Ball Revisited: Stability, Sub-linear Time Algorithms, and Extension

In this paper, we revisit the Minimum Enclosing Ball (MEB) problem and i...
research
01/24/2019

Greedy Strategy Works for Clustering with Outliers and Coresets Construction

We study the problems of clustering with outliers in high dimension. Tho...
research
02/27/2020

The Effectiveness of Johnson-Lindenstrauss Transform for High Dimensional Optimization with Outliers

Johnson-Lindenstrauss (JL) Transform is one of the most popular methods ...
research
10/18/2018

Distributed k-Clustering for Data with Heavy Noise

In this paper, we consider the k-center/median/means clustering with out...
research
01/07/2023

Randomized Greedy Algorithms and Composable Coreset for k-Center Clustering with Outliers

In this paper, we study the problem of k-center clustering with outliers...
research
10/19/2022

Near-optimal Coresets for Robust Clustering

We consider robust clustering problems in ℝ^d, specifically k-clustering...

Please sign up or login with your details

Forgot password? Click here to reset