Energy and Policy Considerations for Deep Learning in NLP

06/05/2019
by   Emma Strubell, et al.
0

Recent progress in hardware and methodology for training neural networks has ushered in a new generation of large networks trained on abundant data. These models have obtained notable gains in accuracy across many NLP tasks. However, these accuracy improvements depend on the availability of exceptionally large computational resources that necessitate similarly substantial energy consumption. As a result these models are costly to train and develop, both financially, due to the cost of hardware and electricity or cloud compute time, and environmentally, due to the carbon footprint required to fuel modern tensor processing hardware. In this paper we bring this issue to the attention of NLP researchers by quantifying the approximate financial and environmental costs of training a variety of recently successful neural network models for NLP. Based on these findings, we propose actionable recommendations to reduce costs and improve equity in NLP research and practice.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/03/2022

Maintaining Performance with Less Data

We propose a novel method for training a neural network for image classi...
research
07/08/2023

Towards Efficient In-memory Computing Hardware for Quantized Neural Networks: State-of-the-art, Open Challenges and Perspectives

The amount of data processed in the cloud, the development of Internet-o...
research
06/20/2023

Deep Fusion: Efficient Network Training via Pre-trained Initializations

In recent years, deep learning has made remarkable progress in a wide ra...
research
02/02/2023

Energy Efficiency of Training Neural Network Architectures: An Empirical Study

The evaluation of Deep Learning models has traditionally focused on crit...
research
04/26/2023

Tensor Decomposition for Model Reduction in Neural Networks: A Review

Modern neural networks have revolutionized the fields of computer vision...
research
04/21/2021

Carbon Emissions and Large Neural Network Training

The computation demand for machine learning (ML) has grown rapidly recen...
research
12/13/2021

Do Data-based Curricula Work?

Current state-of-the-art NLP systems use large neural networks that requ...

Please sign up or login with your details

Forgot password? Click here to reset