Appearance Fusion of Multiple Cues for Video Co-localization

This work addresses a problem named video co-localization that aims at localizing the objects in videos jointly. Although there are numerous cues available for this purpose, for example, saliency, motion, and joint, their robust fusion can be quite challenging at times due to their spatial inconsistencies. To overcome this, in this paper, we propose a novel appearance fusion method where we fuse appearance models derived from these cues rather than spatially fusing their maps. In this method, we evaluate the cues in terms of their reliability and consensus to guide the appearance fusion process. We also develop a novel joint cue relying on topological hierarchy. We utilize the final fusion results to produce a few candidate bounding boxes and for subsequent optimal selection among them while considering the spatiotemporal constraints. The proposed method achieves promising results on the YouTube Objects dataset.

READ FULL TEXT

page 1

page 3

page 6

page 7

page 8

research
07/04/2020

Jointly Modeling Motion and Appearance Cues for Robust RGB-T Tracking

In this study, we propose a novel RGB-T tracking framework by jointly mo...
research
11/26/2018

Foreground Clustering for Joint Segmentation and Localization in Videos and Images

This paper presents a novel framework in which video/image segmentation ...
research
04/04/2019

Spatiotemporal CNN for Video Object Segmentation

In this paper, we present a unified, end-to-end trainable spatiotemporal...
research
06/26/2017

YoTube: Searching Action Proposal via Recurrent and Static Regression Networks

In this paper, we present YoTube-a novel network fusion framework for se...
research
08/05/2019

SESF-Fuse: An Unsupervised Deep Model for Multi-Focus Image Fusion

In this work, we propose a novel unsupervised deep learning model to add...
research
06/23/2016

Find your Way by Observing the Sun and Other Semantic Cues

In this paper we present a robust, efficient and affordable approach to ...
research
07/25/2017

Motion-Appearance Interactive Encoding for Object Segmentation in Unconstrained Videos

We present a novel method of integrating motion and appearance cues for ...

Please sign up or login with your details

Forgot password? Click here to reset