Screening Data Points in Empirical Risk Minimization via Ellipsoidal Regions and Safe Loss Function

12/05/2019
by   Grégoire Mialon, et al.
6

We design simple screening tests to automatically discard data samples in empirical risk minimization without losing optimization guarantees. We derive loss functions that produce dual objectives with a sparse solution. We also show how to regularize convex losses to ensure such a dual sparsity-inducing property, and propose a general method to design screening tests for classification or regression based on ellipsoidal approximations of the optimal set. In addition to producing computational gains, our approach also allows us to compress a dataset into a subset of representative points.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/07/2015

Primal Method for ERM with Flexible Mini-batching Schemes and Non-convex Losses

In this work we develop a new algorithm for regularized empirical risk m...
research
12/03/2019

Nonparametric Screening under Conditional Strictly Convex Loss for Ultrahigh Dimensional Sparse Data

Sure screening technique has been considered as a powerful tool to handl...
research
06/11/2020

Safe Screening Rules for Generalized Double Sparsity Learning

In a high-dimensional setting, sparse model has shown its power in compu...
research
10/22/2021

Safe rules for the identification of zeros in the solutions of the SLOPE problem

In this paper we propose a methodology to accelerate the resolution of t...
research
10/26/2017

Joint Screening Tests for LASSO

This paper focusses on "safe" screening techniques for the LASSO problem...
research
05/22/2018

Safe Element Screening for Submodular Function Minimization

Submodular functions are discrete analogs of convex functions, which hav...
research
06/17/2023

Reevaluating the Role of Race and Ethnicity in Diabetes Screening

There is active debate over whether to consider patient race and ethnici...

Please sign up or login with your details

Forgot password? Click here to reset