Is It a Plausible Colour? UCapsNet for Image Colourisation

12/04/2020
by   Rita Pucci, et al.
10

Human beings can imagine the colours of a grayscale image with no particular effort thanks to their ability of semantic feature extraction. Can an autonomous system achieve that? Can it hallucinate plausible and vibrant colours? This is the colourisation problem. Different from existing works relying on convolutional neural network models pre-trained with supervision, we cast such colourisation problem as a self-supervised learning task. We tackle the problem with the introduction of a novel architecture based on Capsules trained following the adversarial learning paradigm. Capsule networks are able to extract a semantic representation of the entities in the image but loose details about their spatial information, which is important for colourising a grayscale image. Thus our UCapsNet structure comes with an encoding phase that extracts entities through capsules and spatial details through convolutional neural networks. A decoding phase merges the entity features with the spatial features to hallucinate a plausible colour version of the input datum. Results on the ImageNet benchmark show that our approach is able to generate more vibrant and plausible colours than exiting solutions and achieves superior performance than models pre-trained with supervision.

READ FULL TEXT
research
05/20/2019

Enriching Pre-trained Language Model with Entity Information for Relation Classification

Relation classification is an important NLP task to extract relations be...
research
01/21/2021

Pre-training without Natural Images

Is it possible to use convolutional neural networks pre-trained without ...
research
04/04/2018

Self-supervised Learning of Geometrically Stable Features Through Probabilistic Introspection

Self-supervision can dramatically cut back the amount of manually-labell...
research
02/23/2021

Comparative evaluation of CNN architectures for Image Caption Generation

Aided by recent advances in Deep Learning, Image Caption Generation has ...
research
01/19/2021

Collaboration among Image and Object Level Features for Image Colourisation

Image colourisation is an ill-posed problem, with multiple correct solut...
research
10/29/2020

Speech-Image Semantic Alignment Does Not Depend on Any Prior Classification Tasks

Semantically-aligned (speech, image) datasets can be used to explore "vi...
research
04/10/2017

Weakly-Supervised Spatial Context Networks

We explore the power of spatial context as a self-supervisory signal for...

Please sign up or login with your details

Forgot password? Click here to reset