Weakly-supervised Video Anomaly Detection with Contrastive Learning of Long and Short-range Temporal Features

01/25/2021
by   Yu Tian, et al.
12

In this paper, we address the problem of weakly-supervised video anomaly detection, in which given video-level labels for training, we aim to identify in test videos, the snippets containing abnormal events. Although current methods based on multiple instance learning (MIL) show effective detection performance, they ignore important video temporal dependencies. Also, the number of abnormal snippets can vary per anomaly video, which complicates the training process of MIL-based methods because they tend to focus on the most abnormal snippet – this can cause it to mistakenly select a normal snippet instead of an abnormal snippet, and also to fail to select all abnormal snippets available. We propose a novel method, named Multi-scale Temporal Network trained with top-K Contrastive Multiple Instance Learning (MTN-KMIL), to address the issues above. The main contributions of MTN-KMIL are: 1) a novel synthesis of a pyramid of dilated convolutions and a self-attention mechanism, with the former capturing the multi-scale short-range temporal dependencies between snippets and the latter capturing long-range temporal dependencies; and 2) a novel contrastive MIL learning method that enforces large margins between the top-K normal and abnormal video snippets at the feature representation level and anomaly score level, resulting in accurate anomaly discrimination. Extensive experiments show that our method outperforms several state-of-the-art methods by a large margin on three benchmark data sets (ShanghaiTech, UCF-Crime and XD-Violence). The code is available at https://github.com/tianyu0207/MTN-KMIL

READ FULL TEXT
research
06/03/2022

Anomaly detection in surveillance videos using transformer based attention model

Surveillance footage can catch a wide range of realistic anomalies. This...
research
12/09/2022

CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection

Video anomaly detection (VAD) – commonly formulated as a multiple-instan...
research
02/10/2023

Dual Memory Units with Uncertainty Regulation for Weakly Supervised Video Anomaly Detection

Learning discriminative features for effectively separating abnormal eve...
research
09/14/2022

Real-world Video Anomaly Detection by Extracting Salient Features in Videos

We propose a lightweight and accurate method for detecting anomalies in ...
research
03/22/2023

Unbiased Multiple Instance Learning for Weakly Supervised Video Anomaly Detection

Weakly Supervised Video Anomaly Detection (WSVAD) is challenging because...
research
07/09/2020

Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision

Violence detection has been studied in computer vision for years. Howeve...
research
03/31/2023

Long-Short Temporal Co-Teaching for Weakly Supervised Video Anomaly Detection

Weakly supervised video anomaly detection (WS-VAD) is a challenging prob...

Please sign up or login with your details

Forgot password? Click here to reset