Detection Bank: An Object Detection Based Video Representation for Multimedia Event Recognition

05/28/2014
by   Tim Althoff, et al.
0

While low-level image features have proven to be effective representations for visual recognition tasks such as object recognition and scene classification, they are inadequate to capture complex semantic meaning required to solve high-level visual tasks such as multimedia event detection and recognition. Recognition or retrieval of events and activities can be improved if specific discriminative objects are detected in a video sequence. In this paper, we propose an image representation, called Detection Bank, based on the detection images from a large number of windowed object detectors where an image is represented by different statistics derived from these detections. This representation is extended to video by aggregating the key frame level image representations through mean and max pooling. We empirically show that it captures complementary information to state-of-the-art representations such as Spatial Pyramid Matching and Object Bank. These descriptors combined with our Detection Bank representation significantly outperforms any of the representations alone on TRECVID MED 2011 data.

READ FULL TEXT

page 2

page 4

research
11/14/2014

A Discriminative CNN Video Representation for Event Detection

In this paper, we propose a discriminative video representation for even...
research
12/23/2015

Mid-level Representation for Visual Recognition

Visual Recognition is one of the fundamental challenges in AI, where the...
research
03/07/2016

A novel learning-based frame pooling method for Event Detection

Detecting complex events in a large video collection crawled from video ...
research
10/10/2015

TagBook: A Semantic Video Representation without Supervision for Event Detection

We consider the problem of event detection in video for scenarios where ...
research
11/22/2020

Video SemNet: Memory-Augmented Video Semantic Network

Stories are a very compelling medium to convey ideas, experiences, socia...
research
03/02/2022

A Principled Design of Image Representation: Towards Forensic Tasks

Image forensics is a rising topic as the trustworthy multimedia content ...
research
08/26/2020

Visual Concept Reasoning Networks

A split-transform-merge strategy has been broadly used as an architectur...

Please sign up or login with your details

Forgot password? Click here to reset