BiMLP: Compact Binary Architectures for Vision Multi-Layer Perceptrons

12/29/2022
by   Yixing Xu, et al.
0

This paper studies the problem of designing compact binary architectures for vision multi-layer perceptrons (MLPs). We provide extensive analysis on the difficulty of binarizing vision MLPs and find that previous binarization methods perform poorly due to limited capacity of binary MLPs. In contrast with the traditional CNNs that utilizing convolutional operations with large kernel size, fully-connected (FC) layers in MLPs can be treated as convolutional layers with kernel size 1×1. Thus, the representation ability of the FC layers will be limited when being binarized, and places restrictions on the capability of spatial mixing and channel mixing on the intermediate features. To this end, we propose to improve the performance of binary MLP (BiMLP) model by enriching the representation ability of binary FC layers. We design a novel binary block that contains multiple branches to merge a series of outputs from the same stage, and also a universal shortcut connection that encourages the information flow from the previous stage. The downsampling layers are also carefully designed to reduce the computational complexity while maintaining the classification performance. Experimental results on benchmark dataset ImageNet-1k demonstrate the effectiveness of the proposed BiMLP models, which achieve state-of-the-art accuracy compared to prior binary CNNs. The MindSpore code is available at <https://gitee.com/mindspore/models/tree/master/research/cv/BiMLP>.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/29/2023

BinaryViT: Pushing Binary Vision Transformers Towards Convolutional Models

With the increasing popularity and the increasing size of vision transfo...
research
05/05/2021

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

We propose RepMLP, a multi-layer-perceptron-style neural network buildin...
research
03/27/2023

Architecturing Binarized Neural Networks for Traffic Sign Recognition

Traffic signs support road safety and managing the flow of traffic, henc...
research
06/03/2019

Deeply-supervised Knowledge Synergy

Convolutional Neural Networks (CNNs) have become deeper and more complic...
research
09/03/2022

Towards Accurate Binary Neural Networks via Modeling Contextual Dependencies

Existing Binary Neural Networks (BNNs) mainly operate on local convoluti...
research
12/21/2021

RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality

Compared to convolutional layers, fully-connected (FC) layers are better...
research
08/25/2023

Squeeze aggregated excitation network

Convolutional neural networks have spatial representations which read pa...

Please sign up or login with your details

Forgot password? Click here to reset