A Deep Spatial Contextual Long-term Recurrent Convolutional Network for Saliency Detection

10/06/2016
by   Nian Liu, et al.
0

Traditional saliency models usually adopt hand-crafted image features and human-designed mechanisms to calculate local or global contrast. In this paper, we propose a novel computational saliency model, i.e., deep spatial contextual long-term recurrent convolutional network (DSCLRCN) to predict where people looks in natural scenes. DSCLRCN first automatically learns saliency related local features on each image location in parallel. Then, in contrast with most other deep network based saliency models which infer saliency in local contexts, DSCLRCN can mimic the cortical lateral inhibition mechanisms in human visual system to incorporate global contexts to assess the saliency of each image location by leveraging the deep spatial long short-term memory (DSLSTM) model. Moreover, we also integrate scene context modulation in DSLSTM for saliency inference, leading to a novel deep spatial contextual LSTM (DSCLSTM) model. The whole network can be trained end-to-end and works efficiently when testing. Experimental results on two benchmark datasets show that DSCLRCN can achieve state-of-the-art performance on saliency detection. Furthermore, the proposed DSCLSTM model can significantly boost the saliency detection performance by incorporating both global spatial interconnections and scene context modulation, which may uncover novel inspirations for studies on them in computational saliency models.

READ FULL TEXT

page 1

page 9

research
08/18/2016

Deeply-Supervised Recurrent Convolutional Neural Network for Saliency Detection

This paper proposes a novel saliency detection method by developing a de...
research
11/09/2018

Semantic and Contrast-Aware Saliency

In this paper, we proposed an integrated model of semantic-aware and con...
research
02/01/2017

Visual Saliency Prediction Using a Mixture of Deep Neural Networks

Visual saliency models have recently begun to incorporate deep learning ...
research
04/12/2016

Recurrent Attentional Networks for Saliency Detection

Convolutional-deconvolution networks can be adopted to perform end-to-en...
research
07/06/2015

End-to-end Convolutional Network for Saliency Prediction

The prediction of saliency areas in images has been traditionally addres...
research
10/10/2015

DeepFix: A Fully Convolutional Neural Network for predicting Human Eye Fixations

Understanding and predicting the human visual attentional mechanism is a...
research
11/17/2016

Instance-aware Image and Sentence Matching with Selective Multimodal LSTM

Effective image and sentence matching depends on how to well measure the...

Please sign up or login with your details

Forgot password? Click here to reset