Interpretable Neuroevolutionary Models for Learning Non-Differentiable Functions and Programs

07/16/2020
by   Allan Costa, et al.
50

A key factor in the modern success of deep learning is the astonishing expressive power of neural networks. However, this comes at the cost of complex, black-boxed models that are unable to extrapolate beyond the domain of the training dataset, conflicting with goals of expressing physical laws or building human-readable programs. In this paper, we introduce OccamNet, a neural network model that can find interpretable, compact and sparse solutions for fitting data, à la Occam's razor. Our model defines a probability distribution over a non-differentiable function space, and we introduce an optimization method that samples functions and updates the weights based on cross-entropy matching in an evolutionary strategy: we train by biasing the probability mass towards better fitting solutions. We demonstrate that we can fit a variety of algorithms, ranging from simple analytic functions through recursive programs to even simple image classification. Our method takes minimal memory footprint, does not require AI accelerators for efficient training, fits complicated functions in minutes of training on a single CPU, and demonstrates significant performance gains when scaled on GPU. Our implementation, demonstrations and instructions for reproducing the experiments are available at https://github.com/AllanSCosta/occam-net.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/09/2019

Differentiable Approximation Bridges For Training Networks Containing Non-Differentiable Functions

Modern neural network training relies on piece-wise (sub-)differentiable...
research
10/04/2021

Effectiveness of Optimization Algorithms in Deep Image Classification

Adam is applied widely to train neural networks. Different kinds of Adam...
research
02/10/2022

EvoJAX: Hardware-Accelerated Neuroevolution

Evolutionary computation has been shown to be a highly effective method ...
research
08/07/2023

Distributionally Robust Classification on a Data Budget

Real world uses of deep learning require predictable model behavior unde...
research
05/09/2023

Recursions Are All You Need: Towards Efficient Deep Unfolding Networks

The use of deep unfolding networks in compressive sensing (CS) has seen ...
research
12/02/2007

Summarization and Classification of Non-Poisson Point Processes

Fitting models for non-Poisson point processes is complicated by the lac...
research
12/06/2021

Interpretable Image Classification with Differentiable Prototypes Assignment

We introduce ProtoPool, an interpretable image classification model with...

Please sign up or login with your details

Forgot password? Click here to reset