Zero-Shot Crowd Behavior Recognition

08/16/2019
by   Xun Xu, et al.
4

Understanding crowd behavior in video is challenging for computer vision. There have been increasing attempts on modeling crowded scenes by introducing ever larger property ontologies (attributes) and annotating ever larger training datasets. However, in contrast to still images, manually annotating video attributes needs to consider spatiotemporal evolution which is inherently much harder and more costly. Critically, the most interesting crowd behaviors captured in surveillance videos (e.g., street fighting, flash mobs) are either rare, thus have few examples for model training, or unseen previously. Existing crowd analysis techniques are not readily scalable to recognize novel (unseen) crowd behaviors. To address this problem, we investigate and develop methods for recognizing visual crowd behavioral attributes without any training samples, i.e., zero-shot learning crowd behavior recognition. To that end, we relax the common assumption that each individual crowd video instance is only associated with a single crowd attribute. Instead, our model learns to jointly recognize multiple crowd behavioral attributes in each video instance by exploring multiattribute cooccurrence as contextual knowledge for optimizing individual crowd attribute recognition. Joint multilabel attribute prediction in zero-shot learning is inherently nontrivial because cooccurrence statistics does not exist for unseen attributes. To solve this problem, we learn to predict cross-attribute cooccurrence from both online text corpus and multilabel annotation of videos with known attributes. Our experiments show that this approach to modeling multiattribute context not only improves zero-shot crowd behavior recognition on the WWW crowd video dataset, but also generalizes to novel behavior (violence) detection cross-domain in the Violence Flow video dataset.

READ FULL TEXT

page 3

page 14

page 20

page 21

page 22

page 26

research
01/15/2022

Towards Zero-shot Sign Language Recognition

This paper tackles the problem of zero-shot sign language recognition (Z...
research
09/15/2014

Zero Shot Recognition with Unreliable Attributes

In principle, zero-shot learning makes it possible to train a recognitio...
research
09/22/2017

Context Embedding Networks

Low dimensional embeddings that capture the main variations of interest ...
research
06/07/2018

Probabilistic AND-OR Attribute Grouping for Zero-Shot Learning

In zero-shot learning (ZSL), a classifier is trained to recognize visual...
research
12/11/2018

Zero-Shot Learning with Sparse Attribute Propagation

Zero-shot learning (ZSL) aims to recognize a set of unseen classes witho...
research
07/29/2017

Zero-Shot Activity Recognition with Verb Attribute Induction

In this paper, we investigate large-scale zero-shot activity recognition...
research
12/08/2019

Zero-shot Recognition of Complex Action Sequences

Zero-shot video classification for fine-grained activity recognition has...

Please sign up or login with your details

Forgot password? Click here to reset