Compare and Contrast: Learning Prominent Visual Differences

03/31/2018
by   Steven Chen, et al.
0

Relative attribute models can compare images in terms of all detected properties or attributes, exhaustively predicting which image is fancier, more natural, and so on without any regard to ordering. However, when humans compare images, certain differences will naturally stick out and come to mind first. These most noticeable differences, or prominent differences, are likely to be described first. In addition, many differences, although present, may not be mentioned at all. In this work, we introduce and model prominent differences, a rich new functionality for comparing images. We collect instance-level annotations of most noticeable differences, and build a model trained on relative attribute features that predicts prominent differences for unseen pairs. We test our model on the challenging UT-Zap50K shoes and LFW10 faces datasets, and outperform an array of baseline methods. We then demonstrate how our prominence model improves two vision tasks, image search and description generation, enabling more natural communication between people and vision systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/19/2016

Semantic Jitter: Dense Supervision for Visual Comparisons via Synthetic Images

Distinguishing subtle differences in attributes is valuable, yet learnin...
research
03/08/2015

Understanding Image Virality

Virality of online content on social networking websites is an important...
research
02/03/2021

L2C: Describing Visual Differences Needs Semantic Understanding of Individuals

Recent advances in language and vision push forward the research of capt...
research
08/31/2018

Learning to Describe Differences Between Pairs of Similar Images

In this paper, we introduce the task of automatically generating text to...
research
01/08/2019

Thinking Outside the Pool: Active Training Image Creation for Relative Attributes

Current wisdom suggests more labeled image data is always better, and ob...
research
04/26/2023

HDR-VDP-3: A multi-metric for predicting image differences, quality and contrast distortions in high dynamic range and regular content

High-Dynamic-Range Visual-Difference-Predictor version 3, or HDR-VDP-3, ...
research
09/24/2018

Give me a hint! Navigating Image Databases using Human-in-the-loop Feedback

In this paper, we introduce an attribute-based interactive image search ...

Please sign up or login with your details

Forgot password? Click here to reset