A Benchmark Dataset and Saliency-guided Stacked Autoencoders for Video-based Salient Object Detection

11/01/2016
by   Jia Li, et al.
0

Image-based salient object detection (SOD) has been extensively studied in the past decades. However, video-based SOD is much less explored since there lack large-scale video datasets within which salient objects are unambiguously defined and annotated. Toward this end, this paper proposes a video-based SOD dataset that consists of 200 videos (64 minutes). In constructing the dataset, we manually annotate all objects and regions over 7,650 uniformly sampled keyframes and collect the eye-tracking data of 23 subjects that free-view all videos. From the user data, we find salient objects in video can be defined as objects that consistently pop-out throughout the video, and objects with such attributes can be unambiguously annotated by combining manually annotated object/region masks with eye-tracking data of multiple subjects. To the best of our knowledge, it is currently the largest dataset for video-based salient object detection. Based on this dataset, this paper proposes an unsupervised baseline approach for video-based SOD by using saliency-guided stacked autoencoders. In the proposed approach, multiple spatiotemporal saliency cues are first extracted at pixel, superpixel and object levels. With these saliency cues, stacked autoencoders are unsupervisedly constructed which automatically infer a saliency score for each pixel by progressively encoding the high-dimensional saliency cues gathered from the pixel and its spatiotemporal neighbors. Experimental results show that the proposed unsupervised approach outperforms 30 state-of-the-art models on the proposed dataset, including 19 image-based & classic (unsupervised or non-deep learning), 6 image-based & deep learning, and 5 video-based & unsupervised. Moreover, benchmarking results show that the proposed dataset is very challenging and has the potential to boost the development of video-based SOD.

READ FULL TEXT

page 1

page 2

page 5

page 6

page 8

page 11

page 14

research
09/11/2019

Distortion-adaptive Salient Object Detection in 360^∘ Omnidirectional Images

Image-based salient object detection (SOD) has been extensively explored...
research
10/11/2012

Unsupervised Detection and Tracking of Arbitrary Objects with Dependent Dirichlet Process Mixtures

This paper proposes a technique for the unsupervised detection and track...
research
04/05/2022

Learning Video Salient Object Detection Progressively from Unlabeled Videos

Recent deep learning-based video salient object detection (VSOD) has ach...
research
11/14/2018

How Drones Look: Crowdsourced Knowledge Transfer for Aerial Video Saliency Prediction

In ground-level platforms, many saliency models have been developed to p...
research
07/24/2021

ASOD60K: Audio-Induced Salient Object Detection in Panoramic Videos

Exploring to what humans pay attention in dynamic panoramic scenes is us...
research
09/27/2022

Video-based estimation of pain indicators in dogs

Dog owners are typically capable of recognizing behavioral cues that rev...
research
01/22/2020

A Fixation-based 360° Benchmark Dataset for Salient Object Detection

Fixation prediction (FP) in panoramic contents has been widely investiga...

Please sign up or login with your details

Forgot password? Click here to reset