Sparsified Model Zoo Twins: Investigating Populations of Sparsified Neural Network Models

04/26/2023
by   Dominik Honegger, et al.
6

With growing size of Neural Networks (NNs), model sparsification to reduce the computational cost and memory demand for model inference has become of vital interest for both research and production. While many sparsification methods have been proposed and successfully applied on individual models, to the best of our knowledge their behavior and robustness has not yet been studied on large populations of models. With this paper, we address that gap by applying two popular sparsification methods on populations of models (so called model zoos) to create sparsified versions of the original zoos. We investigate the performance of these two methods for each zoo, compare sparsification layer-wise, and analyse agreement between original and sparsified populations. We find both methods to be very robust with magnitude pruning able outperform variational dropout with the exception of high sparsification ratios above 80 Further, we find sparsified models agree to a high degree with their original non-sparsified counterpart, and that the performance of original and sparsified model is highly correlated. Finally, all models of the model zoos and their sparsified model twins are publicly available: modelzoos.cc.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/25/2019

Survey of Dropout Methods for Deep Neural Networks

Dropout methods are a family of stochastic techniques used in neural net...
research
09/29/2022

Model Zoos: A Dataset of Diverse Populations of Neural Network Models

In the last years, neural networks (NN) have evolved from laboratory env...
research
04/02/2022

Paoding: Supervised Robustness-preserving Data-free Neural Network Pruning

When deploying pre-trained neural network models in real-world applicati...
research
09/22/2022

EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models

Neural models are known to be over-parameterized, and recent work has sh...
research
01/01/2018

Robust comparisons of variation using ratios of interquantile ranges

There are two major shortcomings of the F-test for testing the equality ...
research
06/29/2020

The Heterogeneity Hypothesis: Finding Layer-Wise Dissimilated Network Architecture

In this paper, we tackle the problem of convolutional neural network des...
research
10/28/2021

Exoplanet atmosphere evolution: emulation with random forests

Atmospheric mass-loss is known to play a leading role in sculpting the d...

Please sign up or login with your details

Forgot password? Click here to reset