Layered Sampling for Robust Optimization Problems

02/27/2020
by   Hu Ding, et al.
0

In real world, our datasets often contain outliers. Moreover, the outliers can seriously affect the final machine learning result. Most existing algorithms for handling outliers take high time complexities (e.g. quadratic or cubic complexity). Coreset is a popular approach for compressing data so as to speed up the optimization algorithms. However, the current coreset methods cannot be easily extended to handle the case with outliers. In this paper, we propose a new variant of coreset technique, layered sampling, to deal with two fundamental robust optimization problems: k-median/means clustering with outliers and linear regression with outliers. This new coreset method is in particular suitable to speed up the iterative algorithms (which often improve the solution within a local range) for those robust optimization problems. Moreover, our method is easy to be implemented in practice. We expect that our framework of layered sampling will be applicable to other robust optimization problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/30/2021

Robust Coreset for Continuous-and-Bounded Learning (with Outliers)

In this big data era, we often confront large-scale data in many machine...
research
04/20/2020

A Sub-linear Time Framework for Geometric Optimization with Outliers in High Dimensions

Many real-world problems can be formulated as geometric optimization pro...
research
02/27/2020

The Effectiveness of Johnson-Lindenstrauss Transform for High Dimensional Optimization with Outliers

Johnson-Lindenstrauss (JL) Transform is one of the most popular methods ...
research
01/07/2023

Sublinear Time Algorithms for Several Geometric Optimization (With Outliers) Problems In Machine Learning

In this paper, we study several important geometric optimization problem...
research
05/24/2019

A Practical Framework for Solving Center-Based Clustering with Outliers

Clustering has many important applications in computer science, but real...
research
10/09/2022

Coresets for Wasserstein Distributionally Robust Optimization Problems

Wasserstein distributionally robust optimization () is a popular model t...
research
11/29/2019

A robust method based on LOVO functions for solving least squares problems

The robust adjustment of nonlinear models to data is considered in this ...

Please sign up or login with your details

Forgot password? Click here to reset