L-CAD: Language-based Colorization with Any-level Descriptions

05/24/2023
by   Zheng Chang, et al.
0

Language-based colorization produces plausible and visually pleasing colors under the guidance of user-friendly natural language descriptions. Previous methods implicitly assume that users provide comprehensive color descriptions for most of the objects in the image, which leads to suboptimal performance. In this paper, we propose a unified model to perform language-based colorization with any-level descriptions. We leverage the pretrained cross-modality generative model for its robust language understanding and rich color priors to handle the inherent ambiguity of any-level descriptions. We further design modules to align with input conditions to preserve local spatial structures and prevent the ghosting effect. With the proposed novel sampling strategy, our model achieves instance-aware colorization in diverse and complex scenarios. Extensive experimental results demonstrate our advantages of effectively handling any-level descriptions and outperforming both language-based and automatic colorization methods. The code and pretrained models are available at: https://github.com/changzheng123/L-CAD.

READ FULL TEXT

page 2

page 7

page 11

page 13

page 14

page 15

page 16

page 17

research
09/15/2021

What Vision-Language Models `See' when they See Scenes

Images can be described in terms of the objects they contain, or in term...
research
07/12/2022

PLM-ICD: Automatic ICD Coding with Pretrained Language Models

Automatically classifying electronic health records (EHRs) into diagnost...
research
06/13/2016

Learning to Generate Compositional Color Descriptions

The production of color language is essential for grounded language gene...
research
06/30/2023

Hierarchical Neural Coding for Controllable CAD Model Generation

This paper presents a novel generative model for Computer Aided Design (...
research
12/22/2022

DDColor: Towards Photo-Realistic and Semantic-Aware Image Colorization via Dual Decoders

Automatic image colorization is a particularly challenging problem. Due ...
research
07/01/2021

An Investigation of the (In)effectiveness of Counterfactually Augmented Data

While pretrained language models achieve excellent performance on natura...
research
07/20/2022

BigColor: Colorization using a Generative Color Prior for Natural Images

For realistic and vivid colorization, generative priors have recently be...

Please sign up or login with your details

Forgot password? Click here to reset