Learning Gaze Transitions from Depth to Improve Video Saliency Estimation

03/11/2016
by   G. Leifman, et al.
0

In this paper we introduce a novel Depth-Aware Video Saliency approach to predict human focus of attention when viewing RGBD videos on regular 2D screens. We train a generative convolutional neural network which predicts a saliency map for a frame, given the fixation map of the previous frame. Saliency estimation in this scenario is highly important since in the near future 3D video content will be easily acquired and yet hard to display. This can be explained, on the one hand, by the dramatic improvement of 3D-capable acquisition equipment. On the other hand, despite the considerable progress in 3D display technologies, most of the 3D displays are still expensive and require wearing special glasses. To evaluate the performance of our approach, we present a new comprehensive database of eye-fixation ground-truth for RGBD videos. Our experiments indicate that integrating depth into video saliency calculation is beneficial. We demonstrate that our approach outperforms state-of-the-art methods for video saliency, achieving 15 improvement.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 9

research
05/19/2023

ViDaS Video Depth-aware Saliency Network

We introduce ViDaS, a two-stream, fully convolutional Video, Depth-Aware...
research
01/06/2019

Unsupervised uncertainty estimation using spatiotemporal cues in video saliency detection

In this paper, we address the problem of quantifying reliability of comp...
research
09/21/2023

Using Saliency and Cropping to Improve Video Memorability

Video memorability is a measure of how likely a particular video is to b...
research
05/15/2019

Synthetic Defocus and Look-Ahead Autofocus for Casual Videography

In cinema, large camera lenses create beautiful shallow depth of field (...
research
05/20/2019

Are all the frames equally important?

In this work, we address the problem of measuring and predicting tempora...
research
11/14/2018

How Drones Look: Crowdsourced Knowledge Transfer for Aerial Video Saliency Prediction

In ground-level platforms, many saliency models have been developed to p...
research
11/26/2019

Revisiting Deep Architectures for Head Motion Prediction in 360° Videos

Head motion prediction is an important problem with 360 videos, in parti...

Please sign up or login with your details

Forgot password? Click here to reset