Predicting Aesthetic Score Distribution through Cumulative Jensen-Shannon Divergence

08/23/2017
by   Xin Jin, et al.
0

Aesthetic quality prediction is a challenging task in the computer vision community because of the complex interplay with semantic contents and photographic technologies. Recent studies on the powerful deep learning based aesthetic quality assessment usually use a binary high-low label or a numerical score to represent the aesthetic quality. However the scalar representation cannot describe well the underlying varieties of the human perception of aesthetics. In this work, we propose to predict the aesthetic score distribution (i.e., a score distribution vector of the ordinal basic human ratings) using Deep Convolutional Neural Network (DCNN). Conventional DCNNs which aim to minimize the difference between the predicted scalar numbers or vectors and the ground truth cannot be directly used for the ordinal basic rating distribution. Thus, a novel CNN based on the Cumulative distribution with Jensen-Shannon divergence (CJS-CNN) is presented to predict the aesthetic score distribution of human ratings, with a new reliability-sensitive learning method based on the kurtosis of the score distribution, which eliminates the requirement of the original full data of human ratings (without normalization). Experimental results on large scale aesthetic dataset demonstrate the effectiveness of our introduced CJS-CNN in this task.

READ FULL TEXT
research
10/15/2020

A Deep Drift-Diffusion Model for Image Aesthetic Score Distribution Prediction

The task of aesthetic quality assessment is complicated due to its subje...
research
04/17/2019

MOSNet: Deep Learning based Objective Assessment for Voice Conversion

Existing objective evaluation metrics for voice conversion (VC) are not ...
research
11/02/2022

End-to-end deep multi-score model for No-reference stereoscopic image quality assessment

Deep learning-based quality metrics have recently given significant impr...
research
12/04/2022

Lightweight Facial Attractiveness Prediction Using Dual Label Distribution

Facial attractiveness prediction (FAP) aims to assess the facial attract...
research
01/16/2016

Brain-Inspired Deep Networks for Image Aesthetics Assessment

Image aesthetics assessment has been challenging due to its subjective n...
research
07/31/2020

A Pyramid Recurrent Network for Predicting Crowdsourced Speech-Quality Ratings of Real-World Signals

The real-world capabilities of objective speech quality measures are lim...
research
06/07/2021

Exploring to establish an appropriate model for mage aesthetic assessment via CNN-based RSRL: An empirical study

To establish an appropriate model for photo aesthetic assessment, in thi...

Please sign up or login with your details

Forgot password? Click here to reset