Universal Supervised Learning for Individual Data

12/22/2018
by   Yaniv Fogel, et al.
0

Universal supervised learning is considered from an information theoretic point of view following the universal prediction approach, see Merhav and Feder (1998). We consider the standard supervised "batch" learning where prediction is done on a test sample once the entire training data is observed, and the individual setting where the features and labels, both in the training and test, are specific individual quantities. The information theoretic approach naturally uses the self-information loss or log-loss. Our results provide universal learning schemes that compete with a "genie" (or reference) that knows the true test label. In particular, it is demonstrated that the main proposed scheme, termed Predictive Normalized Maximum Likelihood (pNML), is a robust learning solution that outperforms the current leading approach based on Empirical Risk Minimization (ERM). Furthermore, the pNML construction provides a pointwise indication for the learnability of the specific test challenge with the given training examples

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/28/2019

Deep pNML: Predictive Normalized Maximum Likelihood for Deep Neural Networks

The Predictive Normalized Maximum Likelihood (pNML) scheme has been rece...
research
12/31/2018

Deep Information Theoretic Registration

This paper establishes an information theoretic framework for deep metri...
research
05/12/2019

A New Look at an Old Problem: A Universal Learning Approach to Linear Regression

Linear regression is a classical paradigm in statistics. A new look at i...
research
09/30/2020

First-order Optimization for Superquantile-based Supervised Learning

Classical supervised learning via empirical risk (or negative log-likeli...
research
11/19/2017

Compression-Based Regularization with an Application to Multi-Task Learning

This paper investigates, from information theoretic grounds, a learning ...
research
01/24/2019

General Supervision via Probabilistic Transformations

Different types of training data have led to numerous schemes for superv...
research
01/16/2020

Masking schemes for universal marginalisers

We consider the effect of structure-agnostic and structure-dependent mas...

Please sign up or login with your details

Forgot password? Click here to reset