Differential Description Length for Hyperparameter Selection in Machine Learning

02/13/2019
by   Anders Host-Madsen, et al.
0

This paper introduces a new method for model selection and more generally hyperparameter selection in machine learning. The paper first proves a relationship between generalization error and a difference of description lengths of the training data; we call this difference differential description length (DDL). This allows prediction of generalization error from the training data alone by performing encoding of the training data. This can now be used for model selection by choosing the model that has the smallest predicted generalization error. We show how this encoding can be done for linear regression and neural networks. We provide experiments showing that this leads to smaller generalization error than cross-validation and traditional MDL and Bayes methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/21/2009

Sparsification and feature selection by compressive linear regression

The Minimum Description Length (MDL) principle states that the optimal m...
research
09/26/2020

Small Data, Big Decisions: Model Selection in the Small-Data Regime

Highly overparametrized neural networks can display curiously strong gen...
research
01/12/2023

Toward Theoretical Guidance for Two Common Questions in Practical Cross-Validation based Hyperparameter Selection

We show, to our knowledge, the first theoretical treatments of two commo...
research
02/22/2022

Connecting Optimization and Generalization via Gradient Flow Path Length

Optimization and generalization are two essential aspects of machine lea...
research
08/10/2017

Automatic Selection of t-SNE Perplexity

t-Distributed Stochastic Neighbor Embedding (t-SNE) is one of the most w...
research
06/23/2021

Training Data Subset Selection for Regression with Controlled Generalization Error

Data subset selection from a large number of training instances has been...
research
07/12/2019

Sparsely Activated Networks

Previous literature on unsupervised learning focused on designing struct...

Please sign up or login with your details

Forgot password? Click here to reset