A Sub-linear Time Framework for Geometric Optimization with Outliers in High Dimensions

04/20/2020
by   Hu Ding, et al.
0

Many real-world problems can be formulated as geometric optimization problems in high dimensions, especially in the fields of machine learning and data mining. Moreover, we often need to take into account of outliers when optimizing the objective functions. However, the presence of outliers could make the problems to be much more challenging than their vanilla versions. In this paper, we study the fundamental minimum enclosing ball (MEB) with outliers problem first; partly inspired by the core-set method from Bădoiu and Clarkson, we propose a sub-linear time bi-criteria approximation algorithm based on two novel techniques, the Uniform-Adaptive Sampling method and Sandwich Lemma. To the best of our knowledge, our result is the first sub-linear time algorithm, which has the sample size (i.e., the number of sampled points) independent of both the number of input points n and dimensionality d, for MEB with outliers in high dimensions. Furthermore, we observe that these two techniques can be generalized to deal with a broader range of geometric optimization problems with outliers in high dimensions, including flat fitting, k-center clustering, and SVM with outliers, and therefore achieve the sub-linear time algorithms for these problems respectively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/07/2023

Sublinear Time Algorithms for Several Geometric Optimization (With Outliers) Problems In Machine Learning

In this paper, we study several important geometric optimization problem...
research
04/08/2019

Minimum Enclosing Ball Revisited: Stability, Sub-linear Time Algorithms, and Extension

In this paper, we revisit the Minimum Enclosing Ball (MEB) problem and i...
research
02/27/2020

Layered Sampling for Robust Optimization Problems

In real world, our datasets often contain outliers. Moreover, the outlie...
research
02/28/2021

Is Simple Uniform Sampling Efficient for Center-Based Clustering With Outliers: When and Why?

Clustering has many important applications in computer science, but real...
research
04/08/2019

Minimum Enclosing Ball Revisited: Stability and Sub-linear Time Algorithms

In this paper, we revisit the Minimum Enclosing Ball (MEB) problem and i...
research
02/28/2019

Probabilistic smallest enclosing ball in high dimensions via subgradient sampling

We study a variant of the median problem for a collection of point sets ...
research
02/23/2021

Robust k-Center with Two Types of Radii

In the non-uniform k-center problem, the objective is to cover points in...

Please sign up or login with your details

Forgot password? Click here to reset