Video Object of Interest Segmentation

12/06/2022
by   Siyuan Zhou, et al.
0

In this work, we present a new computer vision task named video object of interest segmentation (VOIS). Given a video and a target image of interest, our objective is to simultaneously segment and track all objects in the video that are relevant to the target image. This problem combines the traditional video object segmentation task with an additional image indicating the content that users are concerned with. Since no existing dataset is perfectly suitable for this new task, we specifically construct a large-scale dataset called LiveVideos, which contains 2418 pairs of target images and live videos with instance-level annotations. In addition, we propose a transformer-based method for this task. We revisit Swin Transformer and design a dual-path structure to fuse video and image features. Then, a transformer decoder is employed to generate object proposals for segmentation and tracking from the fused features. Extensive experiments on LiveVideos dataset show the superiority of our proposed method.

READ FULL TEXT

page 1

page 6

page 11

page 12

page 13

research
05/12/2019

Video Instance Segmentation

In this paper we present a new computer vision task, named video instanc...
research
07/02/2019

Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation

Video object segmentation (VOS) aims at pixel-level object tracking give...
research
01/16/2021

VideoClick: Video Object Segmentation with a Single Click

Annotating videos with object segmentation masks typically involves a tw...
research
09/21/2023

PanoVOS:Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation

Panoramic videos contain richer spatial information and have attracted t...
research
05/04/2023

Tracking through Containers and Occluders in the Wild

Tracking objects with persistence in cluttered and dynamic environments ...
research
08/24/2019

Where Is My Mirror?

Mirrors are everywhere in our daily lives. Existing computer vision syst...
research
11/14/2021

Co-segmentation Inspired Attention Module for Video-based Computer Vision Tasks

Computer vision tasks can benefit from the estimation of the salient obj...

Please sign up or login with your details

Forgot password? Click here to reset