CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection

12/09/2022
by   Hyekang Kevin Joo, et al.
0

Video anomaly detection (VAD) – commonly formulated as a multiple-instance learning problem in a weakly-supervised manner due to its labor-intensive nature – is a challenging problem in video surveillance where the frames of anomaly need to be localized in an untrimmed video. In this paper, we first propose to utilize the ViT-encoded visual features from CLIP, in contrast with the conventional C3D or I3D features in the domain, to efficiently extract discriminative representations in the novel technique. We then model long- and short-range temporal dependencies and nominate the snippets of interest by leveraging our proposed Temporal Self-Attention (TSA). The ablation study conducted on each component confirms its effectiveness in the problem, and the extensive experiments show that our proposed CLIP-TSA outperforms the existing state-of-the-art (SOTA) methods by a large margin on two commonly-used benchmark datasets in the VAD problem (UCF-Crime and ShanghaiTech Campus). The source code will be made publicly available upon acceptance.

READ FULL TEXT
research
01/25/2021

Weakly-supervised Video Anomaly Detection with Contrastive Learning of Long and Short-range Temporal Features

In this paper, we address the problem of weakly-supervised video anomaly...
research
06/03/2022

Anomaly detection in surveillance videos using transformer based attention model

Surveillance footage can catch a wide range of realistic anomalies. This...
research
06/26/2023

Learning Prompt-Enhanced Context Features for Weakly-Supervised Video Anomaly Detection

Video anomaly detection under weak supervision is challenging due to the...
research
11/13/2022

SCOTCH and SODA: A Transformer Video Shadow Detection Framework

Shadows in videos are difficult to detect because of the large shadow de...
research
01/19/2023

Human-Scene Network: A Novel Baseline with Self-rectifying Loss for Weakly supervised Video Anomaly Detection

Video anomaly detection in surveillance systems with only video-level la...
research
03/31/2023

Long-Short Temporal Co-Teaching for Weakly Supervised Video Anomaly Detection

Weakly supervised video anomaly detection (WS-VAD) is a challenging prob...
research
08/21/2023

TeD-SPAD: Temporal Distinctiveness for Self-supervised Privacy-preservation for video Anomaly Detection

Video anomaly detection (VAD) without human monitoring is a complex comp...

Please sign up or login with your details

Forgot password? Click here to reset