On Generalization and Regularization in Deep Learning

04/05/2017
by   Pirmin Lemberger, et al.
0

Why do large neural network generalize so well on complex tasks such as image classification or speech recognition? What exactly is the role regularization for them? These are arguably among the most important open questions in machine learning today. In a recent and thought provoking paper [C. Zhang et al.] several authors performed a number of numerical experiments that hint at the need for novel theoretical concepts to account for this phenomenon. The paper stirred quit a lot of excitement among the machine learning community but at the same time it created some confusion as discussions on OpenReview.net testifies. The aim of this pedagogical paper is to make this debate accessible to a wider audience of data scientists without advanced theoretical knowledge in statistical learning. The focus here is on explicit mathematical definitions and on a discussion of relevant concepts, not on proofs for which we provide references.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/09/2019

Challenging the Boundaries of Speech Recognition: The MALACH Corpus

There has been huge progress in speech recognition over the last several...
research
03/24/2018

Machine Learning and Applied Linguistics

This entry introduces the topic of machine learning and provides an over...
research
04/24/2019

Horseshoe Regularization for Machine Learning in Complex and Deep Models

Since the advent of the horseshoe priors for regularization, global-loca...
research
05/29/2021

Ten Quick Tips for Deep Learning in Biology

Machine learning is a modern approach to problem-solving and task automa...
research
05/26/2023

Sources of Uncertainty in Machine Learning – A Statisticians' View

Machine Learning and Deep Learning have achieved an impressive standard ...
research
01/10/2020

Data-Dependence of Plateau Phenomenon in Learning with Neural Network — Statistical Mechanical Analysis

The plateau phenomenon, wherein the loss value stops decreasing during t...
research
11/09/2022

Profiling and Improving the PyTorch Dataloader for high-latency Storage: A Technical Report

A growing number of Machine Learning Frameworks recently made Deep Learn...

Please sign up or login with your details

Forgot password? Click here to reset