Deep Learning for Saliency Prediction in Natural Video

04/27/2016
by   Souad Chaabouni, et al.
0

The purpose of this paper is the detection of salient areas in natural video by using the new deep learning techniques. Salient patches in video frames are predicted first. Then the predicted visual fixation maps are built upon them. We design the deep architecture on the basis of CaffeNet implemented with Caffe toolkit. We show that changing the way of data selection for optimisation of network parameters, we can save computation cost up to 12 times. We extend deep learning approaches for saliency prediction in still images with RGB values to specificity of video using the sensitivity of the human visual system to residual motion. Furthermore, we complete primary colour pixel values by contrast features proposed in classical visual attention prediction models. The experiments are conducted on two publicly available datasets. The first is IRCCYN video database containing 31 videos with an overall amount of 7300 frames and eye fixations of 37 subjects. The second one is HOLLYWOOD2 provided 2517 movie clips with the eye fixations of 19 subjects. On IRCYYN dataset, the accuracy obtained is of 89.51 saliency of patches show the improvement up to 2 The resulting accuracy of 76, 6 predicted saliency maps with visual fixation maps shows the increase up to 16 on a sample of video clips from this dataset.

READ FULL TEXT

page 7

page 8

page 16

page 18

research
01/30/2019

Understanding spatial correlation in eye-fixation maps for visual attention in videos

In this paper, we present an analysis of recorded eye-fixation data from...
research
05/01/2020

A Naturalness Evaluation Database for Video Prediction Models

The study of video prediction models is believed to be a fundamental app...
research
03/13/2018

A Learning-Based Visual Saliency Prediction Model for Stereoscopic 3D Video (LBVS-3D)

Over the past decade, many computational saliency prediction models have...
research
11/20/2020

ATSal: An Attention Based Architecture for Saliency Prediction in 360 Videos

The spherical domain representation of 360 video/image presents many cha...
research
11/29/2017

Saccade Sequence Prediction: Beyond Static Saliency Maps

Visual attention is a field with a considerable history, with eye moveme...
research
07/27/2023

NSA: Naturalistic Support Artifact to Boost Network Confidence

Visual AI systems are vulnerable to natural and synthetic physical corru...
research
03/23/2017

Saliency-guided video classification via adaptively weighted learning

Video classification is productive in many practical applications, and t...

Please sign up or login with your details

Forgot password? Click here to reset