Totally Looks Like - How Humans Compare, Compared to Machines

03/05/2018
by   Amir Rosenfeld, et al.
0

Perceptual judgment of image similarity by humans relies on a rich internal representations ranging from low-level features to high-level concepts, scene properties and even cultural associations. Existing methods and datasets attempting to explain perceived similarity use stimuli which arguably do not cover the full breadth of factors that affect human similarity judgments, even those geared toward this goal. We introduce a new dataset dubbed Totally-Looks-Like (TTL) after a popular entertainment website, which contains images paired by humans as being visually similar. The dataset contains 6016 image-pairs from the wild, shedding light upon a rich and diverse set of criteria employed by human beings. We conduct experiments to try to reproduce the pairings via features extracted from state-of-the-art deep convolutional neural networks, as well as additional human experiments to verify the consistency of the collected data. Though we create conditions to artificially make the matching task increasingly easier, we show that machine-extracted representations perform very poorly in terms of reproducing the matching selected by humans. We discuss and analyze these results, suggesting future directions for improvement of learned image representations.

READ FULL TEXT

page 2

page 10

page 13

research
08/06/2016

Adapting Deep Network Features to Capture Psychological Representations

Deep neural networks have become increasingly successful at solving clas...
research
07/12/2022

Twin identification over viewpoint change: A deep convolutional neural network surpasses humans

Deep convolutional neural networks (DCNNs) have achieved human-level acc...
research
03/08/2015

Understanding Image Virality

Virality of online content on social networking websites is an important...
research
07/28/2016

Gated Siamese Convolutional Neural Network Architecture for Human Re-Identification

Matching pedestrians across multiple camera views, known as human re-ide...
research
07/25/2023

Do humans and Convolutional Neural Networks attend to similar areas during scene classification: Effects of task and image type

Deep Learning models like Convolutional Neural Networks (CNN) are powerf...
research
04/06/2018

Cross-Domain Image Matching with Deep Feature Maps

We investigate the problem of automatically determining what type of sho...
research
02/14/2018

Similarity measures for vocal-based drum sample retrieval using deep convolutional auto-encoders

The expressive nature of the voice provides a powerful medium for commun...

Please sign up or login with your details

Forgot password? Click here to reset