Dual Complementary Dynamic Convolution for Image Recognition

11/11/2022
by   Longbin Yan, et al.
0

As a powerful engine, vanilla convolution has promoted huge breakthroughs in various computer tasks. However, it often suffers from sample and content agnostic problems, which limits the representation capacities of the convolutional neural networks (CNNs). In this paper, we for the first time model the scene features as a combination of the local spatial-adaptive parts owned by the individual and the global shift-invariant parts shared to all individuals, and then propose a novel two-branch dual complementary dynamic convolution (DCDC) operator to flexibly deal with these two types of features. The DCDC operator overcomes the limitations of vanilla convolution and most existing dynamic convolutions who capture only spatial-adaptive features, and thus markedly boosts the representation capacities of CNNs. Experiments show that the DCDC operator based ResNets (DCDC-ResNets) significantly outperform vanilla ResNets and most state-of-the-art dynamic convolutional networks on image classification, as well as downstream tasks including object detection, instance and panoptic segmentation tasks, while with lower FLOPs and parameters.

READ FULL TEXT

page 2

page 4

page 7

page 8

page 10

research
06/06/2023

CiT-Net: Convolutional Neural Networks Hand in Hand with Vision Transformers for Medical Image Segmentation

The hybrid architecture of convolutional neural networks (CNNs) and Tran...
research
04/22/2020

DyNet: Dynamic Convolution for Accelerating Convolutional Neural Networks

Convolution operator is the core of convolutional neural networks (CNNs)...
research
09/16/2022

Omni-Dimensional Dynamic Convolution

Learning a single static convolutional kernel in each convolutional laye...
research
08/16/2018

Anatomy Of High-Performance Deep Learning Convolutions On SIMD Architectures

Convolution layers are prevalent in many classes of deep neural networks...
research
04/22/2021

Heterogeneous Grid Convolution for Adaptive, Efficient, and Controllable Computation

This paper proposes a novel heterogeneous grid convolution that builds a...
research
12/02/2021

Object-aware Monocular Depth Prediction with Instance Convolutions

With the advent of deep learning, estimating depth from a single RGB ima...
research
05/29/2019

Attention Based Pruning for Shift Networks

In many application domains such as computer vision, Convolutional Layer...

Please sign up or login with your details

Forgot password? Click here to reset