Support vector machines and linear regression coincide with very high-dimensional features

05/28/2021
by   Navid Ardeshir, et al.
0

The support vector machine (SVM) and minimum Euclidean norm least squares regression are two fundamentally different approaches to fitting linear models, but they have recently been connected in models for very high-dimensional data through a phenomenon of support vector proliferation, where every training example used to fit an SVM becomes a support vector. In this paper, we explore the generality of this phenomenon and make the following contributions. First, we prove a super-linear lower bound on the dimension (in terms of sample size) required for support vector proliferation in independent feature models, matching the upper bounds from previous works. We further identify a sharp phase transition in Gaussian feature models, bound the width of this transition, and give experimental support for its universality. Finally, we hypothesize that this phase transition occurs only in much higher-dimensional settings in the ℓ_1 variant of the SVM, and we present a new geometric characterization of the problem that may elucidate this phenomenon for the general ℓ_p case.

READ FULL TEXT

page 7

page 9

page 26

research
09/22/2020

On the proliferation of support vectors in high dimensions

The support vector machine (SVM) is a well-established classification me...
research
05/03/2023

New Equivalences Between Interpolation and SVMs: Kernels and Structured Features

The support vector machine (SVM) is a supervised learning algorithm that...
research
12/21/2020

Predicting the Critical Number of Layers for Hierarchical Support Vector Regression

Hierarchical support vector regression (HSVR) models a function from dat...
research
07/21/2021

Predicting trajectory behaviour via machine-learned invariant manifolds

In this paper we use support vector machines (SVM) to develop a machine ...
research
04/18/2015

On the consistency of Multithreshold Entropy Linear Classifier

Multithreshold Entropy Linear Classifier (MELC) is a recent classifier i...
research
11/18/2020

Benign Overfitting in Binary Classification of Gaussian Mixtures

Deep neural networks generalize well despite being exceedingly overparam...
research
05/02/2020

SVM-Lattice: A Recognition Evaluation Frame for Double-peaked Profiles

In big data era, the special data with rare characteristics may be of gr...

Please sign up or login with your details

Forgot password? Click here to reset