Is Bottom-Up Attention Useful for Scene Recognition?

07/22/2013
by   Samuel F. Dodge, et al.
0

The human visual system employs a selective attention mechanism to understand the visual world in an eficient manner. In this paper, we show how computational models of this mechanism can be exploited for the computer vision application of scene recognition. First, we consider saliency weighting and saliency pruning, and provide a comparison of the performance of different attention models in these approaches in terms of classification accuracy. Pruning can achieve a high degree of computational savings without significantly sacrificing classification accuracy. In saliency weighting, however, we found that classification performance does not improve. In addition, we present a new method to incorporate salient and non-salient regions for improved classification accuracy. We treat the salient and non-salient regions separately and combine them using Multiple Kernel Learning. We evaluate our approach using the UIUC sports dataset and find that with a small training size, our method improves upon the classification accuracy of the baseline bag of features approach.

READ FULL TEXT

page 3

page 4

research
07/22/2013

A study of parameters affecting visual saliency assessment

Since the early 2000s, computational visual saliency has been a very act...
research
11/18/2014

Unsupervised Neural Architecture for Saliency Detection: Extended Version

We propose a novel neural network architecture for visual saliency detec...
research
03/31/2019

Multi-vision Attention Networks for On-line Red Jujube Grading

To solve the red jujube classification problem, this paper designs a con...
research
05/29/2019

Recurrent Existence Determination Through Policy Optimization

Binary determination of the presence of objects is one of the problems w...
research
12/01/2021

CYBORG: Blending Human Saliency Into the Loss Improves Deep Learning

Can deep learning models achieve greater generalization if their trainin...
research
07/23/2023

EnTri: Ensemble Learning with Tri-level Representations for Explainable Scene Recognition

Scene recognition based on deep-learning has made significant progress, ...
research
05/03/2021

MemX: An Attention-Aware Smart Eyewear System for Personalized Moment Auto-capture

This work presents MemX: a biologically-inspired attention-aware eyewear...

Please sign up or login with your details

Forgot password? Click here to reset