Res2Net: A New Multi-scale Backbone Architecture

04/02/2019
by   Shang-Hua Gao, et al.
0

Representing features at multiple scales is of great importance for numerous vision tasks. Recent advances in backbone convolutional neural networks (CNNs) continually demonstrate stronger multi-scale representation ability, leading to consistent performance gains on a wide range of applications. However, most existing methods represent the multi-scale features in a layer-wise manner. In this paper, we propose a novel building block for CNNs, namely Res2Net, by constructing hierarchical residual-like connections within one single residual block. The Res2Net represents multi-scale features at a granular level and increases the range of receptive fields for each network layer. The proposed Res2Net block can be plugged into the state-of-the-art backbone CNN models, e.g., ResNet, ResNeXt, and DLA. We evaluate the Res2Net block on all these models and demonstrate consistent performance gains over baseline models on widely-used datasets, e.g., CIFAR-100 and ImageNet. Further ablation studies and experimental results on representative computer vision tasks, i.e., object detection, class activation mapping, and salient object detection, further verify the superiority of the Res2Net over the state-of-the-art baseline methods. The source code and trained models will be made publicly available.

READ FULL TEXT

page 1

page 6

page 7

research
10/15/2020

HS-ResNet: Hierarchical-Split Block on Convolutional Neural Network

This paper addresses representational block named Hierarchical-Split Blo...
research
01/01/2023

GoogLe2Net: Going Transverse with Convolutions

Capturing feature information effectively is of great importance in visi...
research
10/28/2022

Grafting Vision Transformers

Vision Transformers (ViTs) have recently become the state-of-the-art acr...
research
12/21/2020

Centralized Information Interaction for Salient Object Detection

The U-shape structure has shown its advantage in salient object detectio...
research
06/11/2019

FAMED-Net: A Fast and Accurate Multi-scale End-to-end Dehazing Network

Single image dehazing is a critical image pre-processing step for subseq...
research
09/25/2021

TreeNet: A lightweight One-Shot Aggregation Convolutional Network

The architecture of deep convolutional networks (CNNs) has evolved for y...
research
04/20/2019

Data-Driven Neuron Allocation for Scale Aggregation Networks

Successful visual recognition networks benefit from aggregating informat...

Please sign up or login with your details

Forgot password? Click here to reset