SAVOIAS: A Diverse, Multi-Category Visual Complexity Dataset

10/03/2018
by   Elham Saraee, et al.
4

Visual complexity identifies the level of intricacy and details in an image or the level of difficulty to describe the image. It is an important concept in a variety of areas such as cognitive psychology, computer vision and visualization, and advertisement. Yet, efforts to create large, downloadable image datasets with diverse content and unbiased groundtruthing are lacking. In this work, we introduce Savoias, a visual complexity dataset that compromises of more than 1,400 images from seven image categories relevant to the above research areas, namely Scenes, Advertisements, Visualization and infographics, Objects, Interior design, Art, and Suprematism. The images in each category portray diverse characteristics including various low-level and high-level features, objects, backgrounds, textures and patterns, text, and graphics. The ground truth for Savoias is obtained by crowdsourcing more than 37,000 pairwise comparisons of images using the forced-choice methodology and with more than 1,600 contributors. The resulting relative scores are then converted to absolute visual complexity scores using the Bradley-Terry method and matrix completion. When applying five state-of-the-art algorithms to analyze the visual complexity of the images in the Savoias dataset, we found that the scores obtained from these baseline tools only correlate well with crowdsourced labels for abstract patterns in the Suprematism category (Pearson correlation r=0.84). For the other categories, in particular, the objects and advertisement categories, low correlation coefficients were revealed (r=0.3 and 0.56, respectively). These findings suggest that (1) state-of-the-art approaches are mostly insufficient and (2) Savoias enables category-specific method development, which is likely to improve the impact of visual complexity analysis on specific application areas, including computer vision.

READ FULL TEXT

page 1

page 5

page 6

research
12/20/2022

HouseCat6D – A Large-Scale Multi-Modal Category Level 6D Object Pose Dataset with Household Objects in Realistic Scenarios

Estimating the 6D pose of objects is one of the major fields in 3D compu...
research
10/12/2021

AVoE: A Synthetic 3D Dataset on Understanding Violation of Expectation for Artificial Cognition

Recent work in cognitive reasoning and computer vision has engendered an...
research
04/19/2022

A Tour of Visualization Techniques for Computer Vision Datasets

We survey a number of data visualization techniques for analyzing Comput...
research
03/08/2015

Understanding Image Virality

Virality of online content on social networking websites is an important...
research
02/03/2021

One Label, One Billion Faces: Usage and Consistency of Racial Categories in Computer Vision

Computer vision is widely deployed, has highly visible, society altering...
research
10/09/2018

Understanding and Predicting the Memorability of Natural Scene Images

Memorability measures how easily an image is to be memorized after glanc...

Please sign up or login with your details

Forgot password? Click here to reset