A Geometry-Sensitive Approach for Photographic Style Classification

09/03/2019
by   Koustav Ghosal, et al.
14

Photographs are characterized by different compositional attributes like the Rule of Thirds, depth of field, vanishing-lines etc. The presence or absence of one or more of these attributes contributes to the overall artistic value of an image. In this work, we analyze the ability of deep learning based methods to learn such photographic style attributes. We observe that although a standard CNN learns the texture and appearance based features reasonably well, its understanding of global and geometric features is limited by two factors. First, the data-augmentation strategies (cropping, warping, etc.) distort the composition of a photograph and affect the performance. Secondly, the CNN features, in principle, are translation-invariant and appearance-dependent. But some geometric properties important for aesthetics, e.g. the Rule of Thirds (RoT), are position-dependent and appearance-invariant. Therefore, we propose a novel input representation which is geometry-sensitive, position-cognizant and appearance-invariant. We further introduce a two-column CNN architecture that performs better than the state-of-the-art (SoA) in photographic style classification. From our results, we observe that the proposed network learns both the geometric and appearance-based attributes better than the SoA.

READ FULL TEXT

page 1

page 2

page 3

page 7

research
11/08/2021

Composition and Style Attributes Guided Image Aesthetic Assessment

The aesthetic quality of an image is defined as the measure or appreciat...
research
06/16/2018

Deformable Generator Network: Unsupervised Disentanglement of Appearance and Geometry

We propose a deformable generator model to disentangle the appearance an...
research
07/10/2020

Geometric Style Transfer

Neural style transfer (NST), where an input image is rendered in the sty...
research
11/01/2018

CariGANs: Unpaired Photo-to-Caricature Translation

Facial caricature is an art form of drawing faces in an exaggerated way ...
research
09/24/2020

Style-invariant Cardiac Image Segmentation with Test-time Augmentation

Deep models often suffer from severe performance drop due to the appeara...
research
04/03/2016

GAL: A Global-Attributes Assisted Labeling System for Outdoor Scenes

An approach that extracts global attributes from outdoor images to facil...
research
05/27/2021

MeshCNN Fundamentals: Geometric Learning through a Reconstructable Representation

Mesh-based learning is one of the popular approaches nowadays to learn s...

Please sign up or login with your details

Forgot password? Click here to reset