Joint Max Margin and Semantic Features for Continuous Event Detection in Complex Scenes

06/13/2017
by   Iman Abbasnejad, et al.
0

In this paper the problem of complex event detection in the continuous domain (i.e. events with unknown starting and ending locations) is addressed. Existing event detection methods are limited to features that are extracted from the local spatial or spatio-temporal patches from the videos. However, this makes the model vulnerable to the events with similar concepts e.g. "Open drawer" and "Open cupboard". In this work, in order to address the aforementioned limitations we present a novel model based on the combination of semantic and temporal features extracted from video frames. We train a max-margin classifier on top of the extracted features in an adaptive framework that is able to detect the events with unknown starting and ending locations. Our model is based on the Bidirectional Region Neural Network and large margin Structural Output SVM. The generality of our model allows it to be simply applied to different labeled and unlabeled datasets. We finally test our algorithm on three challenging datasets, "UCF 101-Action Recognition", "MPII Cooking Activities" and "Hollywood", and we report state-of-the-art performance.

READ FULL TEXT

page 2

page 6

page 7

page 8

page 9

page 10

research
10/16/2020

Toward Accurate Person-level Action Recognition in Videos of Crowded Scenes

Detecting and recognizing human action in videos with crowded scenes is ...
research
06/08/2015

EventNet: A Large Scale Structured Concept Library for Complex Event Detection in Video

Event-specific concepts are the semantic concepts designed for the event...
research
05/04/2020

Slicing and dicing soccer: automatic detection of complex events from spatio-temporal data

The automatic detection of events in sport videos has important applicat...
research
07/24/2022

MAR: Masked Autoencoders for Efficient Action Recognition

Standard approaches for video recognition usually operate on the full in...
research
08/10/2020

Lane Detection Model Based on Spatio-Temporal Network with Double ConvGRUs

Lane detection is one of the indispensable and key elements of self-driv...
research
01/14/2020

Recognizing Video Events with Varying Rhythms

Recognizing Video events in long, complex videos with multiple sub-activ...
research
02/09/2018

Video Event Recognition and Anomaly Detection by Combining Gaussian Process and Hierarchical Dirichlet Process Models

In this paper, we present an unsupervised learning framework for analyzi...

Please sign up or login with your details

Forgot password? Click here to reset