Towards Accurate Binary Neural Networks via Modeling Contextual Dependencies

09/03/2022
by   Xingrun Xing, et al.
0

Existing Binary Neural Networks (BNNs) mainly operate on local convolutions with binarization function. However, such simple bit operations lack the ability of modeling contextual dependencies, which is critical for learning discriminative deep representations in vision models. In this work, we tackle this issue by presenting new designs of binary neural modules, which enables BNNs to learn effective contextual dependencies. First, we propose a binary multi-layer perceptron (MLP) block as an alternative to binary convolution blocks to directly model contextual dependencies. Both short-range and long-range feature dependencies are modeled by binary MLPs, where the former provides local inductive bias and the latter breaks limited receptive field in binary convolutions. Second, to improve the robustness of binary models with contextual dependencies, we compute the contextual dynamic embeddings to determine the binarization thresholds in general binary convolutional blocks. Armed with our binary MLP blocks and improved binary convolution, we build the BNNs with explicit Contextual Dependency modeling, termed as BCDNet. On the standard ImageNet-1K classification benchmark, the BCDNet achieves 72.3 accuracy and outperforms leading binary methods by a large margin. In particular, the proposed BCDNet exceeds the state-of-the-art ReActNet-A by 2.9 Top-1 accuracy with similar operations. Our code is available at https://github.com/Sense-GVT/BCDN

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/16/2019

X-Net: Brain Stroke Lesion Segmentation Based on Depthwise Separable Convolution and Long-range Dependencies

The morbidity of brain stroke increased rapidly in the past few years. T...
research
04/08/2022

Contextual Representation Learning beyond Masked Language Modeling

How do masked language models (MLMs) such as BERT learn contextual repre...
research
01/24/2022

UniFormer: Unifying Convolution and Self-attention for Visual Recognition

It is a challenging task to learn discriminative representation from ima...
research
08/28/2019

A Global-Local Emebdding Module for Fashion Landmark Detection

Detecting fashion landmarks is a fundamental technique for visual clothi...
research
08/17/2021

Contextual Convolutional Neural Networks

We propose contextual convolution (CoConv) for visual recognition. CoCon...
research
12/29/2022

BiMLP: Compact Binary Architectures for Vision Multi-Layer Perceptrons

This paper studies the problem of designing compact binary architectures...
research
07/23/2019

Compact Global Descriptor for Neural Networks

Long-range dependencies modeling, widely used in capturing spatiotempora...

Please sign up or login with your details

Forgot password? Click here to reset