Limits of End-to-End Learning

04/26/2017
by   Tobias Glasmachers, et al.
0

End-to-end learning refers to training a possibly complex learning system by applying gradient-based learning to the system as a whole. End-to-end learning system is specifically designed so that all modules are differentiable. In effect, not only a central learning machine, but also all "peripheral" modules like representation learning and memory formation are covered by a holistic learning process. The power of end-to-end learning has been demonstrated on many tasks, like playing a whole array of Atari video games with a single architecture. While pushing for solutions to more challenging tasks, network architectures keep growing more and more complex. In this paper we ask the question whether and to what extent end-to-end learning is a future-proof technique in the sense of scaling to complex and diverse data processing architectures. We point out potential inefficiencies, and we argue in particular that end-to-end learning does not make optimal use of the modular design of present neural networks. Our surprisingly simple experiments demonstrate these inefficiencies, up to the complete breakdown of learning.

READ FULL TEXT

page 9

page 11

research
05/28/2019

Differentiable Algorithm Networks for Composable Robot Learning

This paper introduces the Differentiable Algorithm Network (DAN), a comp...
research
01/09/2021

Training Deep Architectures Without End-to-End Backpropagation: A Brief Survey

This tutorial paper surveys training alternatives to end-to-end backprop...
research
07/25/2020

Learning Variational Data Assimilation Models and Solvers

This paper addresses variational data assimilation from a learning point...
research
10/13/2016

Gated End-to-End Memory Networks

Machine reading using differentiable reasoning models has recently shown...
research
09/15/2023

Diverse Neural Audio Embeddings – Bringing Features back !

With the advent of modern AI architectures, a shift has happened towards...
research
10/11/2021

Learning with Algorithmic Supervision via Continuous Relaxations

The integration of algorithmic components into neural architectures has ...
research
04/19/2022

Many Episode Learning in a Modular Embodied Agent via End-to-End Interaction

In this work we give a case study of an embodied machine-learning (ML) p...

Please sign up or login with your details

Forgot password? Click here to reset