DeepAI AI Chat
Log In Sign Up

SalNet360: Saliency Maps for omni-directional images with CNN

by   Rafael Monroy, et al.

The prediction of Visual Attention data from any kind of media is of valuable use to content creators and used to efficiently drive encoding algorithms. With the current trend in the Virtual Reality (VR) field, adapting known techniques to this new kind of media is starting to gain momentum. In this paper, we present an architectural extension to any Convolutional Neural Network (CNN) to fine-tune traditional 2D saliency prediction to Omnidirectional Images (ODIs) in an end-to-end manner. We show that each step in the proposed pipeline works towards making the generated saliency map more accurate with respect to ground truth data.


page 5

page 6

page 9

page 14

page 15

page 16

page 17

page 18


Saliency for free: Saliency prediction as a side-effect of object recognition

Saliency is the perceptual capacity of our visual system to focus our at...

Saliency Map Estimation for Omni-Directional Image Considering Prior Distributions

In recent years, the deep learning techniques have been applied to the e...

End-to-end Convolutional Network for Saliency Prediction

The prediction of saliency areas in images has been traditionally addres...

Shallow and Deep Convolutional Networks for Saliency Prediction

The prediction of salient areas in images has been traditionally address...

Learning Saliency Prediction From Sparse Fixation Pixel Map

Ground truth for saliency prediction datasets consists of two types of m...

Deep3D: Fully Automatic 2D-to-3D Video Conversion with Deep Convolutional Neural Networks

As 3D movie viewing becomes mainstream and Virtual Reality (VR) market e...

Spherical Convolution empowered FoV Prediction in 360-degree Video Multicast with Limited FoV Feedback

Field of view (FoV) prediction is critical in 360-degree video multicast...