Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks

10/05/2020
by   Róbert Csordás, et al.
41

Neural networks (NNs) whose subnetworks implement reusable functions are expected to offer numerous advantages, including compositionality through efficient recombination of functional building blocks, interpretability, preventing catastrophic interference, etc. Understanding if and how NNs are modular could provide insights into how to improve them. Current inspection methods, however, fail to link modules to their functionality. In this paper, we present a novel method based on learning binary weight masks to identify individual weights and subnets responsible for specific functions. Using this powerful tool, we contribute an extensive study of emerging modularity in NNs that covers several standard architectures and datasets. We demonstrate how common NNs fail to reuse submodules and offer new insights into the related issue of systematic generalization on language tasks.

READ FULL TEXT

page 18

page 19

page 23

page 25

page 26

page 27

research
12/25/2021

Neural Network Module Decomposition and Recomposition

We propose a modularization method that decomposes a deep neural network...
research
08/26/2023

Differentiable Weight Masks for Domain Transfer

One of the major drawbacks of deep learning models for computer vision h...
research
02/27/2023

Permutation Equivariant Neural Functionals

This work studies the design of neural networks that can process the wei...
research
05/04/2023

Seeing is Believing: Brain-Inspired Modular Training for Mechanistic Interpretability

We introduce Brain-Inspired Modular Training (BIMT), a method for making...
research
05/31/2022

Differentiable programming for functional connectomics

Mapping the functional connectome has the potential to uncover key insig...
research
05/22/2023

Neural Functional Transformers

The recent success of neural networks as implicit representation of data...
research
06/02/2023

Modularity based linkage model for neuroevolution

Crossover between neural networks is considered disruptive due to the st...

Please sign up or login with your details

Forgot password? Click here to reset