GaterNet: Dynamic Filter Selection in Convolutional Neural Network via a Dedicated Global Gating Network

11/27/2018
by   Zhourong Chen, et al.
0

The concept of conditional computation for deep nets has been proposed previously to improve model performance by selectively using only parts of the model conditioned on the sample it is processing. In this paper, we investigate input-dependent dynamic filter selection in deep convolutional neural networks (CNNs). The problem is interesting because the idea of forcing different parts of the model to learn from different types of samples may help us acquire better filters in CNNs, improve the model generalization performance and potentially increase the interpretability of model behavior. We propose a novel yet simple framework called GaterNet, which involves a backbone and a gater network. The backbone network is a regular CNN that performs the major computation needed for making a prediction, while a global gater network is introduced to generate binary gates for selectively activating filters in the backbone network based on each input. Extensive experiments on CIFAR and ImageNet datasets show that our models consistently outperform the original models with a large margin. On CIFAR-10, our model also improves upon state-of-the-art results.

READ FULL TEXT
research
10/14/2022

Neural Routing in Meta Learning

Meta-learning often referred to as learning-to-learn is a promising noti...
research
11/20/2019

DRNet: Dissect and Reconstruct the Convolutional Neural Network via Interpretable Manners

This paper proposes to use an interpretable method to dissect the channe...
research
04/22/2019

Inner-Imaging Convolutional Networks

Despite the tremendous success in computer vision, deep convolutional ne...
research
06/16/2023

Towards Better Orthogonality Regularization with Disentangled Norm in Training Deep CNNs

Orthogonality regularization has been developed to prevent deep CNNs fro...
research
09/25/2017

Adaptive Convolutional Filter Generation for Natural Language Understanding

Convolutional neural networks (CNNs) have recently emerged as a popular ...
research
12/01/2019

Not All Attention Is Needed: Gated Attention Network for Sequence Data

Although deep neural networks generally have fixed network structures, t...
research
10/28/2022

LOFT: Finding Lottery Tickets through Filter-wise Training

Recent work on the Lottery Ticket Hypothesis (LTH) shows that there exis...

Please sign up or login with your details

Forgot password? Click here to reset