Margin Maximization as Lossless Maximal Compression

01/28/2020
by   Nikolaos Nikolaou, et al.
23

The ultimate goal of a supervised learning algorithm is to produce models constructed on the training data that can generalize well to new examples. In classification, functional margin maximization – correctly classifying as many training examples as possible with maximal confidence –has been known to construct models with good generalization guarantees. This work gives an information-theoretic interpretation of a margin maximizing model on a noiseless training dataset as one that achieves lossless maximal compression of said dataset – i.e. extracts from the features all the useful information for predicting the label and no more. The connection offers new insights on generalization in supervised machine learning, showing margin maximization as a special case (that of classification) of a more general principle and explains the success and potential limitations of popular learning algorithms like gradient boosting. We support our observations with theoretical arguments and empirical evidence and identify interesting directions for future work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/05/2019

Maximal Margin Distribution Support Vector Regression with coupled Constraints-based Convex Optimization

Support vector regression (SVR) is one of the most popular machine learn...
research
02/04/2021

Undecidability of Underfitting in Learning Algorithms

Using recent machine learning results that present an information-theore...
research
10/05/2018

IMMIGRATE: A Margin-based Feature Selection Method with Interaction Terms

By balancing margin-quantity maximization and margin-quality maximizatio...
research
03/02/2023

Benign Overfitting in Linear Classifiers and Leaky ReLU Networks from KKT Conditions for Margin Maximization

Linear classifiers and leaky ReLU networks trained by gradient flow on t...
research
04/29/2022

On the Optimization of Margin Distribution

Margin has played an important role on the design and analysis of learni...
research
01/30/2019

Optimal Minimal Margin Maximization with Boosting

Boosting algorithms produce a classifier by iteratively combining base h...
research
03/14/2018

Algebraic Machine Learning

Machine learning algorithms use error function minimization to fit a lar...

Please sign up or login with your details

Forgot password? Click here to reset