DDColor: Towards Photo-Realistic and Semantic-Aware Image Colorization via Dual Decoders

12/22/2022
by   Xiaoyang Kang, et al.
0

Automatic image colorization is a particularly challenging problem. Due to the high illness of the problem and multi-modal uncertainty, directly training a deep neural network usually leads to incorrect semantic colors and low color richness. Existing transformer-based methods can deliver better results but highly depend on hand-crafted dataset-level empirical distribution priors. In this work, we propose DDColor, a new end-to-end method with dual decoders, for image colorization. More specifically, we design a multi-scale image decoder and a transformer-based color decoder. The former manages to restore the spatial resolution of the image, while the latter establishes the correlation between semantic representations and color queries via cross-attention. The two decoders incorporate to learn semantic-aware color embedding by leveraging the multi-scale visual features. With the help of these two decoders, our method succeeds in producing semantically consistent and visually plausible colorization results without any additional priors. In addition, a simple but effective colorfulness loss is introduced to further improve the color richness of generated results. Our extensive experiments demonstrate that the proposed DDColor achieves significantly superior performance to existing state-of-the-art works both quantitatively and qualitatively. Codes will be made publicly available at https://github.com/piddnad/DDColor.

READ FULL TEXT

page 6

page 7

page 8

page 12

page 13

page 14

page 16

page 17

research
07/03/2023

AVSegFormer: Audio-Visual Segmentation with Transformer

The combination of audio and vision has long been a topic of interest in...
research
01/27/2019

Pixelated Semantic Colorization

While many image colorization algorithms have recently shown the capabil...
research
08/25/2022

Unbiased Multi-Modality Guidance for Image Inpainting

Image inpainting is an ill-posed problem to recover missing or damaged i...
research
11/24/2022

Delving into Out-of-Distribution Detection with Vision-Language Representations

Recognizing out-of-distribution (OOD) samples is critical for machine le...
research
08/06/2023

Language-based Photo Color Adjustment for Graphic Designs

Adjusting the photo color to associate with some design elements is an e...
research
05/24/2023

L-CAD: Language-based Colorization with Any-level Descriptions

Language-based colorization produces plausible and visually pleasing col...
research
01/19/2022

Hiding Data in Colors: Secure and Lossless Deep Image Steganography via Conditional Invertible Neural Networks

Deep image steganography is a data hiding technology that conceal data i...

Please sign up or login with your details

Forgot password? Click here to reset