Eliminating Gradient Conflict in Reference-based Line-art Colorization

07/13/2022
by   Zekun Li, et al.
11

Reference-based line-art colorization is a challenging task in computer vision. The color, texture, and shading are rendered based on an abstract sketch, which heavily relies on the precise long-range dependency modeling between the sketch and reference. Popular techniques to bridge the cross-modal information and model the long-range dependency employ the attention mechanism. However, in the context of reference-based line-art colorization, several techniques would intensify the existing training difficulty of attention, for instance, self-supervised training protocol and GAN-based losses. To understand the instability in training, we detect the gradient flow of attention and observe gradient conflict among attention branches. This phenomenon motivates us to alleviate the gradient issue by preserving the dominant gradient branch while removing the conflict ones. We propose a novel attention mechanism using this training strategy, Stop-Gradient Attention (SGA), outperforming the attention baseline by a large margin with better training stability. Compared with state-of-the-art modules in line-art colorization, our approach demonstrates significant improvements in Fréchet Inception Distance (FID, up to 27.21 several benchmarks. The code of SGA is available at https://github.com/kunkun0w0/SGA .

READ FULL TEXT
research
12/21/2022

Attention-Aware Anime Line Drawing Colorization

Automatic colorization of anime line drawing has attracted much attentio...
research
10/07/2022

Pose Guided Human Image Synthesis with Partially Decoupled GAN

Pose Guided Human Image Synthesis (PGHIS) is a challenging task of trans...
research
07/09/2021

Cross-modal Attention for MRI and Ultrasound Volume Registration

Prostate cancer biopsy benefits from accurate fusion of transrectal ultr...
research
01/21/2022

Representing Long-Range Context for Graph Neural Networks with Global Attention

Graph neural networks are powerful architectures for structured datasets...
research
03/20/2023

AnimeDiffusion: Anime Face Line Drawing Colorization via Diffusion Models

It is a time-consuming and tedious work for manually colorizing anime li...
research
03/24/2020

Dynamic Hierarchical Mimicking Towards Consistent Optimization Objectives

While the depth of modern Convolutional Neural Networks (CNNs) surpasses...

Please sign up or login with your details

Forgot password? Click here to reset