Colorization Transformer

02/08/2021
by   Manoj Kumar, et al.
9

We present the Colorization Transformer, a novel approach for diverse high fidelity image colorization based on self-attention. Given a grayscale image, the colorization proceeds in three steps. We first use a conditional autoregressive transformer to produce a low resolution coarse coloring of the grayscale image. Our architecture adopts conditional transformer layers to effectively condition grayscale input. Two subsequent fully parallel networks upsample the coarse colored low resolution image into a finely colored high resolution image. Sampling from the Colorization Transformer produces diverse colorings whose fidelity outperforms the previous state-of-the-art on colorising ImageNet based on FID results and based on a human evaluation in a Mechanical Turk test. Remarkably, in more than 60 prefer the highest rated among three generated colorings over the ground truth. The code and pre-trained checkpoints for Colorization Transformer are publicly available at https://github.com/google-research/google-research/tree/master/coltran

READ FULL TEXT

page 16

page 17

page 18

page 20

page 21

page 22

page 23

page 24

research
07/23/2022

High-Resolution Swin Transformer for Automatic Medical Image Segmentation

The Resolution of feature maps is critical for medical image segmentatio...
research
05/19/2017

PixColor: Pixel Recursive Colorization

We propose a novel approach to automatically produce multiple colorized ...
research
03/06/2022

PanFormer: a Transformer Based Model for Pan-sharpening

Pan-sharpening aims at producing a high-resolution (HR) multi-spectral (...
research
07/05/2022

Swin Deformable Attention U-Net Transformer (SDAUT) for Explainable Fast MRI

Fast MRI aims to reconstruct a high fidelity image from partially observ...
research
05/30/2023

Bigger, Better, Faster: Human-level Atari with human-level efficiency

We introduce a value-based RL agent, which we call BBF, that achieves su...
research
06/13/2021

Styleformer: Transformer based Generative Adversarial Networks with Style Vector

We propose Styleformer, which is a style-based generator for GAN archite...
research
05/27/2023

Super-Resolution of License Plate Images Using Attention Modules and Sub-Pixel Convolution Layers

Recent years have seen significant developments in the field of License ...

Please sign up or login with your details

Forgot password? Click here to reset