Learning Saliency Prediction From Sparse Fixation Pixel Map

09/03/2018
by   Shanghua Xiao, et al.
0

Ground truth for saliency prediction datasets consists of two types of map data: fixation pixel map which records the human eye movements on sample images, and fixation blob map generated by performing gaussian blurring on the corresponding fixation pixel map. Current saliency approaches perform prediction by directly pixel-wise regressing the input image into saliency map with fixation blob as ground truth, yet learning saliency from fixation pixel map is not explored. In this work, we propose a first-of-its-kind approach of learning saliency prediction from sparse fixation pixel map, and a novel loss function for training from such sparse fixation. We utilize clustering to extract sparse fixation pixel from the raw fixation pixel map, and add a max-pooling transformation on the output to avoid false penalty between sparse outputs and labels caused by nearby but non-overlapping saliency pixels when calculating loss. This approach provides a novel perspective for achieving saliency prediction. We evaluate our approach over multiple benchmark datasets, and achieve competitive performance in terms of multiple metrics comparing with state-of-the-art saliency methods.

READ FULL TEXT
research
01/06/2019

Unsupervised uncertainty estimation using spatiotemporal cues in video saliency detection

In this paper, we address the problem of quantifying reliability of comp...
research
07/06/2015

End-to-end Convolutional Network for Saliency Prediction

The prediction of saliency areas in images has been traditionally addres...
research
08/04/2016

Saliency Integration: An Arbitrator Model

Saliency integration approaches have aroused general concern on unifying...
research
07/28/2021

Evaluating the Use of Reconstruction Error for Novelty Localization

The pixelwise reconstruction error of deep autoencoders is often utilize...
research
06/22/2021

Confidence-Aware Learning for Camouflaged Object Detection

Confidence-aware learning is proven as an effective solution to prevent ...
research
02/07/2020

An Auxiliary Task for Learning Nuclei Segmentation in 3D Microscopy Images

Segmentation of cell nuclei in microscopy images is a prevalent necessit...
research
09/19/2017

SalNet360: Saliency Maps for omni-directional images with CNN

The prediction of Visual Attention data from any kind of media is of val...

Please sign up or login with your details

Forgot password? Click here to reset