An Empirical Deep Dive into Deep Learning's Driving Dynamics

07/25/2022
by   Charles Edison Tripp, et al.
4

We present an empirical dataset surveying the deep learning phenomenon on fully-connected networks, encompassing the training and test performance of numerous network topologies, sweeping across multiple learning tasks, depths, numbers of free parameters, learning rates, batch sizes, and regularization penalties. The dataset probes 178 thousand hyperparameter settings with an average of 20 repetitions each, totaling 3.5 million training runs and 20 performance metrics for each of the 13.1 billion training epochs observed. Accumulating this 671 GB dataset utilized 5,448 CPU core-years, 17.8 GPU-years, and 111.2 node-years. Additionally, we provide a preliminary analysis revealing patterns which persist across learning tasks and topologies. We aim to inspire work empirically studying modern machine learning techniques as a catalyst for the theoretical discoveries needed to progress the field beyond energy-intensive and heuristic practices.

READ FULL TEXT

page 14

page 16

page 18

page 19

page 20

page 22

page 26

page 30

research
02/16/2019

Communication Topologies Between Learning Agents in Deep Reinforcement Learning

A common technique to improve speed and robustness of learning in deep r...
research
11/30/2018

How to Organize your Deep Reinforcement Learning Agents: The Importance of Communication Topology

In this empirical paper, we investigate how learning agents can be arran...
research
11/30/2017

Improved Learning in Evolution Strategies via Sparser Inter-Agent Network Topologies

We draw upon a previously largely untapped literature on human collectiv...
research
07/24/2019

Benchmarking TPU, GPU, and CPU Platforms for Deep Learning

Training deep learning models is compute-intensive and there is an indus...
research
01/04/2022

Optimal circulant graphs as low-latency network topologies

Communication latency has become one of the determining factors for the ...
research
06/15/2020

On the training dynamics of deep networks with L_2 regularization

We study the role of L_2 regularization in deep learning, and uncover si...
research
03/26/2019

Netherlands Dataset: A New Public Dataset for Machine Learning in Seismic Interpretation

Machine learning and, more specifically, deep learning algorithms have s...

Please sign up or login with your details

Forgot password? Click here to reset