Human vs Objective Evaluation of Colourisation Performance

04/11/2022
by   Sean Mullery, et al.
4

Automatic colourisation of grey-scale images is the process of creating a full-colour image from the grey-scale prior. It is an ill-posed problem, as there are many plausible colourisations for a given grey-scale prior. The current SOTA in auto-colourisation involves image-to-image type Deep Convolutional Neural Networks with Generative Adversarial Networks showing the greatest promise. The end goal of colourisation is to produce full colour images that appear plausible to the human viewer, but human assessment is costly and time consuming. This work assesses how well commonly used objective measures correlate with human opinion. We also attempt to determine what facets of colourisation have the most significant effect on human opinion. For each of 20 images from the BSD dataset, we create 65 recolourisations made up of local and global changes. Opinion scores are then crowd sourced using the Amazon Mechanical Turk and together with the images this forms an extensible dataset called the Human Evaluated Colourisation Dataset (HECD). While we find statistically significant correlations between human-opinion scores and a small number of objective measures, the strength of the correlations is low. There is also evidence that human observers are most intolerant to an incorrect hue of naturally occurring objects.

READ FULL TEXT
research
10/05/2022

Artificial (or) Fake Human Face Generator using Generative Adversarial Network (GAN) Machine Learning Model

Graphics algorithms for high quality image rendering are highly involved...
research
11/09/2015

Massive Online Crowdsourced Study of Subjective and Objective Picture Quality

Most publicly available image quality databases have been created under ...
research
02/04/2022

Quality Assessment of Low Light Restored Images: A Subjective Study and an Unsupervised Model

The quality assessment (QA) of restored low light images is an important...
research
09/08/2020

Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions

The Voice Conversion Challenge 2020 is the third edition under its flags...
research
09/15/2017

NIMA: Neural Image Assessment

Automatically learned quality assessment for images has recently become ...
research
12/29/2020

Is human scoring the best criteria for summary evaluation?

Normally, summary quality measures are compared with quality scores prod...

Please sign up or login with your details

Forgot password? Click here to reset