Deep Fusion: Efficient Network Training via Pre-trained Initializations

06/20/2023
by   Hanna Mazzawi, et al.
0

In recent years, deep learning has made remarkable progress in a wide range of domains, with a particularly notable impact on natural language processing tasks. One of the challenges associated with training deep neural networks is the need for large amounts of computational resources and time. In this paper, we present Deep Fusion, an efficient approach to network training that leverages pre-trained initializations of smaller networks. Fusion accelerates the training process, reduces computational requirements, and leads to improved generalization performance on a variety of NLP tasks and T5 model sizes. and effective approach to reduce the training time and resource consumption while maintaining, or even surpassing, the performance of traditional training methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/30/2022

TextPruner: A Model Pruning Toolkit for Pre-Trained Language Models

Pre-trained language models have been prevailed in natural language proc...
research
11/10/2015

Reducing the Training Time of Neural Networks by Partitioning

This paper presents a new method for pre-training neural networks that c...
research
11/21/2018

Dynamic-Net: Tuning the Objective Without Re-training

One of the key ingredients for successful optimization of modern CNNs is...
research
05/24/2022

DNNAbacus: Toward Accurate Computational Cost Prediction for Deep Neural Networks

Deep learning is attracting interest across a variety of domains, includ...
research
11/19/2021

Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari

Across the Arcade Learning Environment, Rainbow achieves a level of perf...
research
10/27/2019

Deep Learning for Plasma Tomography and Disruption Prediction from Bolometer Data

The use of deep learning is facilitating a wide range of data processing...
research
06/05/2019

Energy and Policy Considerations for Deep Learning in NLP

Recent progress in hardware and methodology for training neural networks...

Please sign up or login with your details

Forgot password? Click here to reset