Towards Accurate Binary Neural Networks via Modeling Contextual Dependencies

09/03/2022

∙

Existing Binary Neural Networks (BNNs) mainly operate on local convolutions with binarization function. However, such simple bit operations lack the ability of modeling contextual dependencies, which is critical for learning discriminative deep representations in vision models. In this work, we tackle this issue by presenting new designs of binary neural modules, which enables BNNs to learn effective contextual dependencies. First, we propose a binary multi-layer perceptron (MLP) block as an alternative to binary convolution blocks to directly model contextual dependencies. Both short-range and long-range feature dependencies are modeled by binary MLPs, where the former provides local inductive bias and the latter breaks limited receptive field in binary convolutions. Second, to improve the robustness of binary models with contextual dependencies, we compute the contextual dynamic embeddings to determine the binarization thresholds in general binary convolutional blocks. Armed with our binary MLP blocks and improved binary convolution, we build the BNNs with explicit Contextual Dependency modeling, termed as BCDNet. On the standard ImageNet-1K classification benchmark, the BCDNet achieves 72.3 accuracy and outperforms leading binary methods by a large margin. In particular, the proposed BCDNet exceeds the state-of-the-art ReActNet-A by 2.9 Top-1 accuracy with similar operations. Our code is available at https://github.com/Sense-GVT/BCDN

READ FULL TEXT

Towards Accurate Binary Neural Networks via Modeling Contextual Dependencies

X-Net: Brain Stroke Lesion Segmentation Based on Depthwise Separable Convolution and Long-range Dependencies

Contextual Representation Learning beyond Masked Language Modeling

UniFormer: Unifying Convolution and Self-attention for Visual Recognition

A Global-Local Emebdding Module for Fashion Landmark Detection

Contextual Convolutional Neural Networks

BiMLP: Compact Binary Architectures for Vision Multi-Layer Perceptrons

Compact Global Descriptor for Neural Networks

Towards Accurate Binary Neural Networks via Modeling Contextual Dependencies

Related Research

X-Net: Brain Stroke Lesion Segmentation Based on Depthwise Separable Convolution and Long-range Dependencies

Contextual Representation Learning beyond Masked Language Modeling

UniFormer: Unifying Convolution and Self-attention for Visual Recognition

A Global-Local Emebdding Module for Fashion Landmark Detection

Contextual Convolutional Neural Networks

BiMLP: Compact Binary Architectures for Vision Multi-Layer Perceptrons

Compact Global Descriptor for Neural Networks