Segment Anything Meets Point Tracking

07/03/2023
by   Frano Rajic, et al.
0

The Segment Anything Model (SAM) has established itself as a powerful zero-shot image segmentation model, employing interactive prompts such as points to generate masks. This paper presents SAM-PT, a method extending SAM's capability to tracking and segmenting anything in dynamic videos. SAM-PT leverages robust and sparse point selection and propagation techniques for mask generation, demonstrating that a SAM-based segmentation tracker can yield strong zero-shot performance across popular video object segmentation benchmarks, including DAVIS, YouTube-VOS, and MOSE. Compared to traditional object-centric mask propagation strategies, we uniquely use point propagation to exploit local structure information that is agnostic to object semantics. We highlight the merits of point-based tracking through direct evaluation on the zero-shot open-world Unidentified Video Objects (UVO) benchmark. To further enhance our approach, we utilize K-Medoids clustering for point initialization and track both positive and negative points to clearly distinguish the target object. We also employ multiple mask decoding passes for mask refinement and devise a point re-initialization strategy to improve tracking accuracy. Our code integrates different point trackers and video segmentation benchmarks and will be released at https://github.com/SysCV/sam-pt.

READ FULL TEXT

page 1

page 4

page 5

page 10

page 11

research
08/11/2023

FoodSAM: Any Food Segmentation

In this paper, we explore the zero-shot capability of the Segment Anythi...
research
06/30/2023

Training-free Object Counting with Prompts

This paper tackles the problem of object counting in images. Existing ap...
research
05/04/2023

Personalize Segment Anything Model with One Shot

Driven by large-data pre-training, Segment Anything Model (SAM) has been...
research
05/26/2020

ALBA : Reinforcement Learning for Video Object Segmentation

We consider the challenging problem of zero-shot video object segmentati...
research
03/31/2023

Zero-shot Referring Image Segmentation with Global-Local Context Features

Referring image segmentation (RIS) aims to find a segmentation mask give...
research
08/25/2023

Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation

Tracking any given object(s) spatially and temporally is a common purpos...
research
08/26/2023

Zero-Shot Edge Detection with SCESAME: Spectral Clustering-based Ensemble for Segment Anything Model Estimation

This paper proposes a novel zero-shot edge detection with SCESAME, which...

Please sign up or login with your details

Forgot password? Click here to reset