Tight bounds for maximum ℓ_1-margin classifiers

12/07/2022
by   Stefan Stojanovic, et al.
0

Popular iterative algorithms such as boosting methods and coordinate descent on linear models converge to the maximum ℓ_1-margin classifier, a.k.a. sparse hard-margin SVM, in high dimensional regimes where the data is linearly separable. Previous works consistently show that many estimators relying on the ℓ_1-norm achieve improved statistical rates for hard sparse ground truths. We show that surprisingly, this adaptivity does not apply to the maximum ℓ_1-margin classifier for a standard discriminative setting. In particular, for the noiseless setting, we prove tight upper and lower bounds for the prediction error that match existing rates of order _1^2/3/n^1/3 for general ground truths. To complete the picture, we show that when interpolating noisy observations, the error vanishes at a rate of order 1/√(log(d/n)). We are therefore first to show benign overfitting for the maximum ℓ_1-margin classifier.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/27/2019

Margin-Based Generalization Lower Bounds for Boosted Classifiers

Boosting is one of the most successful ideas in machine learning. The mo...
research
10/28/2021

Tractability from overparametrization: The example of the negative perceptron

In the negative perceptron problem we are given n data points ( x_i,y_i)...
research
11/10/2021

Tight bounds for minimum l1-norm interpolation of noisy data

We provide matching upper and lower bounds of order σ^2/log(d/n) for the...
research
10/29/2020

The Performance Analysis of Generalized Margin Maximizer (GMM) on Separable Data

Logistic models are commonly used for binary classification tasks. The s...
research
02/05/2020

A Precise High-Dimensional Asymptotic Theory for Boosting and Min-L1-Norm Interpolated Classifiers

This paper establishes a precise high-dimensional asymptotic theory for ...
research
04/18/2015

On the consistency of Multithreshold Entropy Linear Classifier

Multithreshold Entropy Linear Classifier (MELC) is a recent classifier i...
research
03/07/2018

Sequential Maximum Margin Classifiers for Partially Labeled Data

In many real-world applications, data is not collected as one batch, but...

Please sign up or login with your details

Forgot password? Click here to reset