WavePaint: Resource-efficient Token-mixer for Self-supervised Inpainting

07/01/2023
by   Pranav Jeevan, et al.
0

Image inpainting, which refers to the synthesis of missing regions in an image, can help restore occluded or degraded areas and also serve as a precursor task for self-supervision. The current state-of-the-art models for image inpainting are computationally heavy as they are based on transformer or CNN backbones that are trained in adversarial or diffusion settings. This paper diverges from vision transformers by using a computationally-efficient WaveMix-based fully convolutional architecture – WavePaint. It uses a 2D-discrete wavelet transform (DWT) for spatial and multi-resolution token-mixing along with convolutional layers. The proposed model outperforms the current state-of-the-art models for image inpainting on reconstruction quality while also using less than half the parameter count and considerably lower training and evaluation times. Our model even outperforms current GAN-based architectures in CelebA-HQ dataset without using an adversarially trainable discriminator. Our work suggests that neural architectures that are modeled after natural image priors require fewer parameters and computations to achieve generalization comparable to transformers.

READ FULL TEXT
research
05/03/2022

Comparison of CoModGANs, LaMa and GLIDE for Art Inpainting- Completing M.C Escher's Print Gallery

Digital art restoration has benefited from inpainting models to correct ...
research
07/01/2023

WaveMixSR: A Resource-efficient Neural Network for Image Super-resolution

Image super-resolution research recently been dominated by transformer m...
research
05/14/2013

Novel variational model for inpainting in the wavelet domain

Wavelet domain inpainting refers to the process of recovering the missin...
research
04/15/2021

Spectrogram Inpainting for Interactive Generation of Instrument Sounds

Modern approaches to sound synthesis using deep neural networks are hard...
research
03/07/2022

WaveMix: Resource-efficient Token Mixing for Images

Although certain vision transformer (ViT) and CNN architectures generali...
research
05/28/2022

WaveMix-Lite: A Resource-efficient Neural Network for Image Analysis

Gains in the ability to generalize on image analysis tasks for neural ne...
research
11/02/2020

Image Inpainting with Learnable Feature Imputation

A regular convolution layer applying a filter in the same way over known...

Please sign up or login with your details

Forgot password? Click here to reset