Visual Saliency Prediction Using a Mixture of Deep Neural Networks

02/01/2017
by   Samuel Dodge, et al.
0

Visual saliency models have recently begun to incorporate deep learning to achieve predictive capacity much greater than previous unsupervised methods. However, most existing models predict saliency using local mechanisms limited to the receptive field of the network. We propose a model that incorporates global scene semantic information in addition to local information gathered by a convolutional neural network. Our model is formulated as a mixture of experts. Each expert network is trained to predict saliency for a set of closely related images. The final saliency map is computed as a weighted mixture of the expert networks' output, with weights determined by a separate gating network. This gating network is guided by global scene information to predict weights. The expert networks and the gating network are trained simultaneously in an end-to-end manner. We show that our mixture formulation leads to improvement in performance over an otherwise identical non-mixture model that does not incorporate global scene information.

READ FULL TEXT

page 5

page 7

research
03/23/2017

Quality Resilient Deep Neural Networks

We study deep neural networks for classification of images with quality ...
research
10/06/2016

A Deep Spatial Contextual Long-term Recurrent Convolutional Network for Saliency Detection

Traditional saliency models usually adopt hand-crafted image features an...
research
10/31/2018

A Mixture of Expert Approach for Low-Cost Customization of Deep Neural Networks

The ability to customize a trained Deep Neural Network (DNN) locally usi...
research
05/07/2017

Deep Visual Attention Prediction

Deep Convolutional Neural Networks (CNNs) have made substantial improvem...
research
03/02/2016

Shallow and Deep Convolutional Networks for Saliency Prediction

The prediction of salient areas in images has been traditionally address...
research
03/15/2023

MRGAN360: Multi-stage Recurrent Generative Adversarial Network for 360 Degree Image Saliency Prediction

Thanks to the ability of providing an immersive and interactive experien...
research
10/05/2016

DeepGaze II: Reading fixations from deep features trained on object recognition

Here we present DeepGaze II, a model that predicts where people look in ...

Please sign up or login with your details

Forgot password? Click here to reset