TSANET: Temporal and Scale Alignment for Unsupervised Video Object Segmentation

03/08/2023
by   Seunghoon Lee, et al.
0

Unsupervised Video Object Segmentation (UVOS) refers to the challenging task of segmenting the prominent object in videos without manual guidance. In other words, the network detects the accurate region of the target object in a sequence of RGB frames without prior knowledge. In recent works, two approaches for UVOS have been discussed that can be divided into: appearance and appearance-motion based methods. Appearance based methods utilize the correlation information of inter-frames to capture target object that commonly appears in a sequence. However, these methods does not consider the motion of target object due to exploit the correlation information between randomly paired frames. Appearance-motion based methods, on the other hand, fuse the appearance features from RGB frames with the motion features from optical flow. Motion cue provides useful information since salient objects typically show distinctive motion in a sequence. However, these approaches have the limitation that the dependency on optical flow is dominant. In this paper, we propose a novel framework for UVOS that can address aforementioned limitations of two approaches in terms of both time and scale. Temporal Alignment Fusion aligns the saliency information of adjacent frames with the target frame to leverage the information of adjacent frames. Scale Alignment Decoder predicts the target object mask precisely by aggregating differently scaled feature maps via continuous mapping with implicit neural representation. We present experimental results on public benchmark datasets, DAVIS 2016 and FBMS, which demonstrate the effectiveness of our method. Furthermore, we outperform the state-of-the-art methods on DAVIS 2016.

READ FULL TEXT

page 2

page 4

research
07/18/2022

Hierarchical Feature Alignment Network for Unsupervised Video Object Segmentation

Optical flow is an easily conceived and precious cue for advancing unsup...
research
09/08/2022

Unsupervised Video Object Segmentation via Prototype Memory Network

Unsupervised video object segmentation aims to segment a target object i...
research
11/22/2022

Domain Alignment and Temporal Aggregation for Unsupervised Video Object Segmentation

Unsupervised video object segmentation aims at detecting and segmenting ...
research
05/11/2021

Video Frame Interpolation via Structure-Motion based Iterative Fusion

Video Frame Interpolation synthesizes non-existent images between adjace...
research
10/01/2022

Motion-inductive Self-supervised Object Discovery in Videos

In this paper, we consider the task of unsupervised object discovery in ...
research
09/29/2019

Exploiting Geometric Constraints on Dense Trajectories for Motion Saliency

The existing approaches for salient motion segmentation are unable to ex...
research
09/10/2021

Automatic Portrait Video Matting via Context Motion Network

Automatic portrait video matting is an under-constrained problem. Most s...

Please sign up or login with your details

Forgot password? Click here to reset