Outliers Detection Is Not So Hard: Approximation Algorithms for Robust Clustering Problems Using Local Search Techniques

12/20/2020
by   Yishui Wang, et al.
0

In this paper, we consider two types of robust models of the k-median/k-means problems: the outlier-version (k-MedO/k-MeaO) and the penalty-version (k-MedP/k-MeaP), in which we can mark some points as outliers and discard them. In k-MedO/k-MeaO, the number of outliers is bounded by a given integer. In k-MedP/k-MeaP, we do not bound the number of outliers, but each outlier will incur a penalty cost. We develop a new technique to analyze the approximation ratio of local search algorithms for these two problems by introducing an adapted cluster that can capture useful information about outliers in the local and the global optimal solution. For k-MeaP, we improve the best known approximation ratio based on local search from 25+ε to 9+ε. For k-MedP, we obtain the best known approximation ratio. For k-MedO/k-MeaO, there exists only two bi-criteria approximation algorithms based on local search. One violates the outlier constraint (the constraint on the number of outliers), while the other violates the cardinality constraint (the constraint on the number of clusters). We consider the former algorithm and improve its approximation ratios from 17+ε to 3+ε for k-MedO, and from 274+ε to 9+ε for k-MeaO.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/13/2020

Consistent k-Median: Simpler, Better and Robust

In this paper we introduce and study the online consistent k-clustering ...
research
12/01/2022

Clustering What Matters: Optimal Approximation for Clustering with Outliers

Clustering with outliers is one of the most fundamental problems in Comp...
research
01/23/2020

Best Principal Submatrix Selection for the Maximum Entropy Sampling Problem: Scalable Algorithms and Performance Guarantees

This paper studies a classic maximum entropy sampling problem (MESP), wh...
research
12/13/2020

Uniform Capacitated Facility Location Problems with Penalties/Outliers

In this paper, we present a framework to design approximation algorithms...
research
09/02/2020

Structural Iterative Rounding for Generalized k-Median Problems

This paper considers approximation algorithms for generalized k-median p...
research
07/11/2022

Breaching the 2 LMP Approximation Barrier for Facility Location with Applications to k-Median

The Uncapacitated Facility Location (UFL) problem is one of the most fun...
research
10/19/2017

Protein Folding Optimization using Differential Evolution Extended with Local Search and Component Reinitialization

This paper presents a novel differential evolution algorithm for protein...

Please sign up or login with your details

Forgot password? Click here to reset