Lockout: Sparse Regularization of Neural Networks

07/15/2021
by   Gilmer Valdes, et al.
0

Many regression and classification procedures fit a parameterized function f(x;w) of predictor variables x to data {x_i,y_i}_1^N based on some loss criterion L(y,f). Often, regularization is applied to improve accuracy by placing a constraint P(w)≤ t on the values of the parameters w. Although efficient methods exist for finding solutions to these constrained optimization problems for all values of t≥0 in the special case when f is a linear function, none are available when f is non-linear (e.g. Neural Networks). Here we present a fast algorithm that provides all such solutions for any differentiable function f and loss L, and any constraint P that is an increasing monotone function of the absolute value of each parameter. Applications involving sparsity inducing regularization of arbitrary Neural Networks are discussed. Empirical results indicate that these sparse solutions are usually superior to their dense counterparts in both accuracy and interpretability. This improvement in accuracy can often make Neural Networks competitive with, and sometimes superior to, state-of-the-art methods in the analysis of tabular data.

READ FULL TEXT

page 9

page 13

research
03/07/2019

Only sparsity based loss function for learning representations

We study the emergence of sparse representations in neural networks. We ...
research
02/02/2023

Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective

The top-k operator returns a k-sparse vector, where the non-zero values ...
research
11/29/2019

Sparsely Grouped Input Variables for Neural Networks

In genomic analysis, biomarker discovery, image recognition, and other s...
research
11/16/2020

LOss-Based SensiTivity rEgulaRization: towards deep sparse neural networks

LOBSTER (LOss-Based SensiTivity rEgulaRization) is a method for training...
research
05/20/2020

An Analysis of Regularized Approaches for Constrained Machine Learning

Regularization-based approaches for injecting constraints in Machine Lea...
research
10/30/2019

Iterative Hessian Sketch in Input Sparsity Time

Scalable algorithms to solve optimization and regression tasks even appr...
research
08/31/2022

Incremental Learning in Diagonal Linear Networks

Diagonal linear networks (DLNs) are a toy simplification of artificial n...

Please sign up or login with your details

Forgot password? Click here to reset