Multi-level Wavelet Convolutional Neural Networks

07/06/2019
by   Pengju Liu, et al.
7

In computer vision, convolutional networks (CNNs) often adopts pooling to enlarge receptive field which has the advantage of low computational complexity. However, pooling can cause information loss and thus is detrimental to further operations such as features extraction and analysis. Recently, dilated filter has been proposed to trade off between receptive field size and efficiency. But the accompanying gridding effect can cause a sparse sampling of input images with checkerboard patterns. To address this problem, in this paper, we propose a novel multi-level wavelet CNN (MWCNN) model to achieve better trade-off between receptive field size and computational efficiency. The core idea is to embed wavelet transform into CNN architecture to reduce the resolution of feature maps while at the same time, increasing receptive field. Specifically, MWCNN for image restoration is based on U-Net architecture, and inverse wavelet transform (IWT) is deployed to reconstruct the high resolution (HR) feature maps. The proposed MWCNN can also be viewed as an improvement of dilated filter and a generalization of average pooling, and can be applied to not only image restoration tasks, but also any CNNs requiring a pooling operation. The experimental results demonstrate effectiveness of the proposed MWCNN for tasks such as image denoising, single image super-resolution, JPEG image artifacts removal and object classification.

READ FULL TEXT

page 4

page 7

page 8

research
05/18/2018

Multi-level Wavelet-CNN for Image Restoration

The tradeoff between receptive field size and efficiency is a crucial is...
research
08/06/2018

Detailed Dense Inference with Convolutional Neural Networks via Discrete Wavelet Transform

Dense pixelwise prediction such as semantic segmentation is an up-to-dat...
research
05/16/2023

Content-Adaptive Downsampling in Convolutional Neural Networks

Many convolutional neural networks (CNNs) rely on progressive downsampli...
research
10/31/2019

Multi-scale Octave Convolutions for Robust Speech Recognition

We propose a multi-scale octave convolution layer to learn robust speech...
research
04/29/2021

Condensation-Net: Memory-Efficient Network Architecture with Cross-Channel Pooling Layers and Virtual Feature Maps

"Lightweight convolutional neural networks" is an important research top...
research
09/27/2022

CEC-CNN: A Consecutive Expansion-Contraction Convolutional Network for Very Small Resolution Medical Image Classification

Deep Convolutional Neural Networks (CNNs) for image classification succe...
research
08/25/2022

Riesz-Quincunx-UNet Variational Auto-Encoder for Satellite Image Denoising

Multiresolution deep learning approaches, such as the U-Net architecture...

Please sign up or login with your details

Forgot password? Click here to reset