Detect Any Shadow: Segment Anything for Video Shadow Detection

05/26/2023
by   Yonghui Wang, et al.
0

Segment anything model (SAM) has achieved great success in the field of natural image segmentation. Nevertheless, SAM tends to classify shadows as background, resulting in poor segmentation performance for shadow detection task. In this paper, we propose an simple but effective approach for fine tuning SAM to detect shadows. Additionally, we also combine it with long short-term attention mechanism to extend its capabilities to video shadow detection. Specifically, we first fine tune SAM by utilizing shadow data combined with sparse prompts and apply the fine-tuned model to detect a specific frame (e.g., first frame) in the video with a little user assistance. Subsequently, using the detected frame as a reference, we employ a long short-term network to learn spatial correlations between distant frames and temporal consistency between contiguous frames, thereby achieving shadow information propagation across frames. Extensive experimental results demonstrate that our method outperforms the state-of-the-art techniques, with improvements of 17.2 validating the effectiveness of our method.

READ FULL TEXT

page 1

page 3

page 4

page 7

page 8

research
09/02/2020

LSMVOS: Long-Short-Term Similarity Matching for Video Object

Objective Semi-supervised video object segmentation refers to segmenting...
research
06/25/2023

When SAM Meets Sonar Images

Segment Anything Model (SAM) has revolutionized the way of segmentation....
research
07/07/2021

Long Short-Term Transformer for Online Action Detection

In this paper, we present Long Short-term TRansformer (LSTR), a new temp...
research
07/16/2020

World-Consistent Video-to-Video Synthesis

Video-to-video synthesis (vid2vid) aims for converting high-level semant...
research
05/31/2018

Attention-Based LSTM for Psychological Stress Detection from Spoken Language Using Distant Supervision

We propose a Long Short-Term Memory (LSTM) with attention mechanism to c...
research
10/24/2019

Anchor Diffusion for Unsupervised Video Object Segmentation

Unsupervised video object segmentation has often been tackled by methods...
research
10/06/2017

Detecting the Moment of Completion: Temporal Models for Localising Action Completion

Action completion detection is the problem of modelling the action's pro...

Please sign up or login with your details

Forgot password? Click here to reset