Post Training in Deep Learning with Last Kernel

11/14/2016
by   Thomas Moreau, et al.
0

One of the main challenges of deep learning methods is the choice of an appropriate training strategy. In particular, additional steps, such as unsupervised pre-training, have been shown to greatly improve the performances of deep structures. In this article, we propose an extra training step, called post-training, which only optimizes the last layer of the network. We show that this procedure can be analyzed in the context of kernel theory, with the first layers computing an embedding of the data and the last layer a statistical model to solve the task based on this embedding. This step makes sure that the embedding, or representation, of the data is used in the best possible way for the considered task. This idea is then tested on multiple architectures with various data sets, showing that it consistently provides a boost in performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/15/2010

Kernel machines with two layers and multiple kernel learning

In this paper, the framework of kernel machines with two layers is intro...
research
12/28/2022

Effectiveness of Deep Image Embedding Clustering Methods on Tabular Data

Deep learning methods in the literature are commonly benchmarked on imag...
research
09/26/2022

Towards Simple and Efficient Task-Adaptive Pre-training for Text Classification

Language models are pre-trained using large corpora of generic data like...
research
04/16/2018

Deep Embedding Kernel

In this paper, we propose a novel supervised learning method that is cal...
research
06/02/2021

SAINT: Improved Neural Networks for Tabular Data via Row Attention and Contrastive Pre-Training

Tabular data underpins numerous high-impact applications of machine lear...
research
12/20/2014

Why does Deep Learning work? - A perspective from Group Theory

Why does Deep Learning work? What representations does it capture? How d...
research
07/09/2019

A Deep Neural Network for Finger Counting and Numerosity Estimation

In this paper, we present neuro-robotics models with a deep artificial n...

Please sign up or login with your details

Forgot password? Click here to reset