Online Convolutional Re-parameterization

04/02/2022
by   Mu Hu, et al.
0

Structural re-parameterization has drawn increasing attention in various computer vision tasks. It aims at improving the performance of deep models without introducing any inference-time cost. Though efficient during inference, such models rely heavily on the complicated training-time blocks to achieve high accuracy, leading to large extra training cost. In this paper, we present online convolutional re-parameterization (OREPA), a two-stage pipeline, aiming to reduce the huge training overhead by squeezing the complex training-time block into a single convolution. To achieve this goal, we introduce a linear scaling layer for better optimizing the online blocks. Assisted with the reduced training cost, we also explore some more effective re-param components. Compared with the state-of-the-art re-param models, OREPA is able to save the training-time memory cost by about 70 around 2x. Meanwhile, equipped with OREPA, the models outperform previous methods on ImageNet by up to +0.6 detection and semantic segmentation and show consistent improvements on the downstream tasks. Codes are available at https://github.com/JUGGHM/OREPA_CVPR2022 .

READ FULL TEXT
research
01/11/2021

RepVGG: Making VGG-style ConvNets Great Again

We present a simple but powerful architecture of convolutional neural ne...
research
02/16/2023

Towards Efficient Visual Adaption via Structural Re-parameterization

Parameter-efficient transfer learning (PETL) is an emerging research spo...
research
03/24/2022

DyRep: Bootstrapping Training with Dynamic Re-parameterization

Structural re-parameterization (Rep) methods achieve noticeable improvem...
research
09/02/2019

Training-Time-Friendly Network for Real-Time Object Detection

Modern object detectors can rarely achieve short training time, fast inf...
research
05/30/2022

Re-parameterizing Your Optimizers rather than Architectures

The well-designed structures in neural networks reflect the prior knowle...
research
10/17/2021

SIN:Superpixel Interpolation Network

Superpixels have been widely used in computer vision tasks due to their ...
research
02/01/2019

Dataset Culling: Towards Efficient Training Of Distillation-Based Domain Specific Models

Real-time CNN based object detection models for applications like survei...

Please sign up or login with your details

Forgot password? Click here to reset