IrEne: Interpretable Energy Prediction for Transformers

06/02/2021
by   Qingqing Cao, et al.
0

Existing software-based energy measurements of NLP models are not accurate because they do not consider the complex interactions between energy consumption and model execution. We present IrEne, an interpretable and extensible energy prediction system that accurately predicts the inference energy consumption of a wide range of Transformer-based NLP models. IrEne constructs a model tree graph that breaks down the NLP model into modules that are further broken down into low-level machine learning (ML) primitives. IrEne predicts the inference energy consumption of the ML primitives as a function of generalizable features and fine-grained runtime resource usage. IrEne then aggregates these low-level predictions recursively to predict the energy of each module and finally of the entire model. Experiments across multiple Transformer models show IrEne predicts inference energy consumption of transformer models with an error of under 7 contrast, existing energy models see an error of over 50 IrEne can be used to conduct energy bottleneck analysis and to easily evaluate the energy impact of different architectural choices. We release the code and data at https://github.com/StonyBrookNLP/irene.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/11/2020

Towards Accurate and Reliable Energy Measurement of NLP Models

Accurate and reliable measurement of energy consumption is critical for ...
research
05/30/2023

Towards Machine Learning and Inference for Resource-constrained MCUs

Machine learning (ML) is moving towards edge devices. However, ML models...
research
05/19/2022

Great Power, Great Responsibility: Recommendations for Reducing Energy for Training Language Models

The energy requirements of current natural language processing models co...
research
08/30/2021

Full-Cycle Energy Consumption Benchmark for Low-Carbon Computer Vision

The energy consumption of deep learning models is increasing at a breath...
research
06/05/2020

Daydream: Accurately Estimating the Efficacy of Optimizations for DNN Training

Modern deep neural network (DNN) training jobs use complex and heterogen...
research
09/23/2022

A Game of Simulation: Modeling and Analyzing the Dragons of Game of Thrones

This paper outlines two approaches for mathematical, simulation, modelin...
research
09/28/2018

Application of the novel fractional grey model FAGMO(1,1,k) to predict China's nuclear energy consumption

At present, the energy structure of China is shifting towards cleaner an...

Please sign up or login with your details

Forgot password? Click here to reset