How hard can it be? Estimating the difficulty of visual search in an image

05/23/2017
by   Radu Tudor Ionescu, et al.
0

We address the problem of estimating image difficulty defined as the human response time for solving a visual search task. We collect human annotations of image difficulty for the PASCAL VOC 2012 data set through a crowd-sourcing platform. We then analyze what human interpretable image properties can have an impact on visual search difficulty, and how accurate are those properties for predicting difficulty. Next, we build a regression model based on deep features learned with state of the art convolutional neural networks and show better results for predicting the ground-truth visual search difficulty scores produced by human annotators. Our model is able to correctly rank about 75 image pairs according to their difficulty score. We also show that our difficulty predictor generalizes well to new classes not seen during training. Finally, we demonstrate that our predicted difficulty scores are useful for weakly supervised object localization (8 object classification (1

READ FULL TEXT

page 2

page 5

research
03/22/2022

Was that so hard? Estimating human classification difficulty

When doctors are trained to diagnose a specific disease, they learn fast...
research
03/26/2018

Efficient Image Dataset Classification Difficulty Estimation for Predicting Deep-Learning Accuracy

In the deep-learning community new algorithms are published at an incred...
research
12/22/2017

Predicting Triple Scoring with Crowdsourcing-specific Features - The fiddlehead Triple Scorer at WSDM Cup 2017

The Triple Scoring Task at the WSDM Cup 2017 involves the prediction of ...
research
04/12/2020

Which visual questions are difficult to answer? Analysis with Entropy of Answer Distributions

We propose a novel approach to identify the difficulty of visual questio...
research
07/04/2016

Modeling of Item-Difficulty for Ontology-based MCQs

Multiple choice questions (MCQs) that can be generated from a domain ont...
research
06/14/2023

Combining piano performance dimensions for score difficulty classification

Predicting the difficulty of playing a musical score is essential for st...
research
07/19/2019

Predicting Visual Memory Schemas with Variational Autoencoders

Visual memory schema (VMS) maps show which regions of an image cause tha...

Please sign up or login with your details

Forgot password? Click here to reset